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Preface 


Over the past several years, I have taught a one-semester graduate 
course on gravitation at the University of Rochester. The principles 
of general relativity were developed systematically in a traditional 
manner, and this book comprises my lectures of this course. The 
class was attended by both graduate as well as several undergrad- 
uate students. Correspondingly, the subject matter was developed 
in detail using tensor analysis (rather than the compact differential 
geometric concepts). The concepts and ideas of the material were 
supplemented by a lot of worked out examples dispersed through 
out the lectures, which the students found extremely useful. 

Although it is the normal convention in a course on gravity to 
use a metric of signature (—,+,+,+), I have used through out the 
lectures the conventional Bjorken-Drell metric signature (+, —,—, —) 
for consistency with my other books. Since the book consists of my 
actual lectures in the course, it is informal with step by step deriva- 
tion of every result which is quite helpful to students, particularly to 
undergraduate students. 

There are many references to the material covered in this course 
and it is impractical to give all the references. Instead, let me note 
only the few following books on the subject which can be used as 
references and which contain further references on the subject. 


1. Gravitation and Cosmology, S. Weinberg, John Wiley (1972), 
New York. 


2. Gravitation, C. W. Misner, K. S. Thorne and J. A. Wheeler, 
W. H. Freeman (1973), San Francisco. 


3. Introduction to General Relativity, R. Adler, M. Bazin and M. 
Schiffer, McGraw-Hill (1975), New York. 


The figures in this book were drawn using Jaxodraw as well as 
PSTricks. I am grateful to the people who developed these extremely 


vii 


vill PREFACE 


useful software. Finally, I would like to thank Fernando Méndez for 
his help with some of the figures and Dave Munson for sorting out 
various computer related problems. 


Ashok Das 
Rochester 
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CHAPTER 1 


Basics of geometry and relativity 


1.1 Two dimensional geometry 


We have all heard of the statement that “Gravitation is the study of 
geometry”. Let us see what this exactly means. We are, of course, 
familiar with Euclidean geometry. For simplicity we will restrict here 
only to two dimensions so that the study of geometry of the man- 
ifold corresponds to the study of surfaces and Euclidean geometry 
corresponds simply to the all familiar plane geometry. 

A geometry, of course, has built in assumptions. For example, 
one has to define the basic elements (concepts) of geometry such as 
points. Furthermore, we know that given any two points we can draw 
many curves joining them. However, the straight line, as we know, 
defines the shortest path between two points in Euclidean geometry. 
Euclid’s axioms, among other things, state that a straight line is 
determined by two points, it may have indefinite length, that any 
two right angles are equal, that only one line parallel to a given line 
can pass through a point outside the line in the same plane, that 
when equals (equal line segments) are added to equals, the sums are 
always equal and that the whole is greater than any of its parts. 
From these axioms Euclid deduced hundreds of theorems which tell 
us a lot about Euclidean geometry. 

For example, let us consider a triangle ABC in plane geometry 
shown in Fig. 1.1. Through the vertex A let us draw a straight line 
DAE parallel to BC. From the properties of the parallel lines we 
immediately conclude that (DAB = Angle DAB) 


Angle DAB = Angle ABC, or, DAB = ABC, (1.1) 


and 
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Figure 1.1: A triangle in plane geometry. 


Angle EAC = Angle ACB, or, EAC = ACB. (1.2) 


On the other hand, we also know that 


DAB + BAC + EAC = 180°. (1.3) 


Thus, we conclude that 


DAB + BAC + EAC = ABC + BAC + ACB = 180°. (1.4) 


This, of course, proves the familiar result that the sum of the angles 
of a triangle on a plane equals 180°. Similarly many more interesting 
results can be obtained by using Euclid’s axioms. 

However, it is Euclid’s axiom on the parallel lines that generated 
much interest. It says, “If a straight line falling on two straight lines 
makes the interior angles on the same side less than two right angles, 
the two straight lines produced indefinitely meet on that side on 
which the angles are less than two right angles”, which is shown in 
Fig. 1.2. Since this axiom involves extending straight lines to the 
inexplorable region of spatial infinity, people tried hard to prove that 
this follows from the other axioms of Euclid and that this is not truly 
an independent axiom. 
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Figure 1.2: Two straight lines in plane geometry which are not par- 
allel according to Euclid’s axiom. 


The study of this question, on the other hand, led to the birth 
of other, non-Euclidean geometries which satisfy all the axioms of 
Euclid except the one on parallel lines. These are geometries of 
curved surfaces where the definition of a straight line carries over 
to what we know as geodesics. (As we will see later, geodesics are 
paths along which the length is extremal.) One such geometry is 
the geometry of the surface of a sphere. On a sphere the geodesics 
(“straight lines”) are parts of great circles joining the opposite pole 
points. It is clear that since any two great circles must intersect (at 
the poles), on a sphere we cannot draw any line parallel to a given 
line through a point outside of the line. 


Another geometry which was developed independently by Gauss, 
Bolyai and Lobachevski is opposite in its properties to the previous 
one (namely, the geometry of a sphere). Therefore, it is also known 
as the geometry of the surface of a pseudosphere. It is rather hard 
to visualize. But a crude example of this geometry is the surface 
of a horse saddle. On such a surface, many geodesics can be drawn 
through a given point not intersecting a given line outside of this 
point and hence in this geometry, it is possible to draw many lines 
parallel to a given line through a point outside of the line. These are 
the three possible distinct geometries we can have. 


In a geometry described by curved surfaces it is no longer true 
that the sum of all angles in a triangle equals 180°. In fact, one 
can simply show that for a triangle on a sphere the sum of angles 
is greater than 180° whereas for the geometry of Gauss, Bolyai and 
Lobachevski, it is less than 180°. Construction of these geometries 
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showed that the fifth postulate of Euclid is indeed independent. 

A question we can ask at this point is what is the geometry of 
the space we live in. Is it flat, or is it like a sphere or is it like a 
pseudosphere? If we were two dimensional creatures living in a two 
dimensional space, we can simply study the properties of geometrical 
figures and deduce the nature of the geometry ourselves. But this 
does not seem right. There ought to exist a general quantity which 
distinguishes one geometry from the others. It must be an inner 
property of the surface (or the manifold) and let us determine this. 

First of all let us note that although a straight line is defined 
as the shortest path between two points, we still have not defined 
the concept of a distance. In plane geometry if two points have 
coordinates (1,22) and (yi,y2), then we know that the shortest 
distance between the two is given by 


1 


d(x,y) = (x1 —y1)" + (x2 —yo)*|*. (1.5) 


Furthermore, it was shown that for the geometry of Gauss, Bolyai 
and Lobachevski, a point with coordinates (x1, x2) satisfies (suitably 
scaled so that the coordinates are dimensionless) 


at +a% <1, (1.6) 


and the shortest distance between two points (x1, 72) and (y1, y2) is 
given by 


d 1- — 
cosh HOY) try say (1.7) 


2 (l-a} a8)? -v-08)? 
669 


Here “a” is a length scale set by the problem under study. 

Gauss realized that the distance d(x, y) which gives the short- 
est path between two points determines the inner properties of a 
space. Thus for example, a cylinder or a cone has the same inner 
properties as a plane since a plane can be rolled into a cylinder or 
a cone without stretching or tearing. (The formula for the distance 
on a cylinder or a cone is the same as on a plane.) However, as we 
all know, a sphere cannot be unrolled into a plane without tearing 
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it and correspondingly the formula for the distance on a sphere is 
different. 

Gauss also recognized that any space can be locally identified 
with a Euclidean space. Hence locally we can set up a Euclidean co- 
ordinate system €;,7 = 1,2, to describe the space so that the infinites- 
imal distance between the points (£1, €2) and (€; + d&, 2 + dé) can 
be written as 


ds? = dé? + dé. (1.8) 


Of course, if the surface is curved, this coordinate system cannot 
cover any finite space still satisfying the laws of plane geometry. Let 
us suppose that there exists a coordinate system (21, 72) which covers 
the entire surface. Then we can write the distance between two points 
with the coordinates (271, x2) and (x, + day, x2 + dx2) (which locally 
coincides with (1,2) and (€ + d&1, 2 + d&2)) as 


ds? = dé?+dé 
7 a of : oe oes . 


0€, \? Of&2\"|. 5 
( ' 3) ao 


Of) Of) | 0&2 Oks 
7 2 |e Ox Te. Ox, Ox2 Fe) ay a 


06: \7 (Aba \? 


= gii(x)da? + 2912(x)day dag + goo(x)dx5 
= gijdxjide;, (1.9) 


where we have identified 
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ra) 2 2 

aes” , ( O62)” 

Ox1 Ox} 

Gy a EE EO ES in 
a? Ox Oxe Ox} Oxe gt : 


dg, \? de, \? 
(SE) + (2). a 


The form of ds? in (1.9) is the sign of a metric space and 
9ij,1,J = 1,2 represents the metric tensor of the two dimensional 
space. Thus, for example, the metric for a plane where 


gu (2) 


g22(x) 


ds? = dx? + dz, (hit) 
is given by 
gu = 922 =1, g2= 0. (1.12) 


For a sphere of fixed radius r, on the other hand, 


ds? = r? (dé? + sin? dd”) (1.13) 


Thus, in this case, the metric takes the form 


goo = 1", 906 =9, 966 = r? sin? 6. (1.14) 


It is this dependence of the metric on the coordinate 0 that gives the 
sphere different inner properties from that of a plane. We can also 
show that for the geometry of Gauss, Bolyai and Lobachevski, the 
metric is given by 


a? (1 — x3) 
Crea 
g(t) = _ eae, 
(1— 2} — 23)” 
2 i= 2 
g2(x) = ea Cae (1.15) 


(1 — 2} — 23) 
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Once we know the metric functions, the length of a path can be 
obtained by integrating ds along that path. Thus the metric func- 
tions determine all the inner properties of a space. Unfortunately, 
however, they are also coordinate dependent. Note that if we pa- 
rameterize a plane with Cartesian coordinates as in (1.12), then we 
have 


gu =922=1, and gi2=0. (1.16) 


However, if we parameterize the same surface with polar coordinates, 
then we can write 


ds? = dr? + r7d6”, (1.17) 
so that 
Grr = 1, 906 = r, and Gre = 0. (1.18) 


This does not look like an Euclidean space (the metric component 
geo is a function of coordinates), but it really is the same space. This 
example shows that although the metric functions determine all the 
inner properties of a space, since their forms depend on the particular 
coordinate system used, they are not very useful in classifying spaces. 
We should look for an object constructed from g;; that does not 
depend on the choice of coordinate system. 

In fact, Gauss had found a unique function (in two dimensions) 
that does this. It is called the Gaussian curvature and is defined (for 
two dimensional spaces) as 


~ 2g Ox10r Ox? - Ox 


— Gir 0922 5 IM2 = Ogu - Og22\" 
4g? Ox Ox} Oxe Ox, 
2 


_ G22 Ogu 2 Og2  Og22\ _ ( Ogi 
4g? Ox, 0x2 Ox} Ox 


K( ~ | A? g12 Ogi aad 
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4 G12 Ogi Og22 6 Ogi1 Og22 
4g? Ox, 0x2 Oxo Ox} 


Ogi2 Ogi O9gi2 Og22 
2— - 2—- - i 
( Ox, 0x2 ) ( 0x9 Ox, )] 


(1.19) 
Here g represents the determinant of the two dimensional metric 9;;, 
namely, 
= = 2 
g = det gij = 911922 — 92: (1.20) 


It is clear from the expressions in (1.19) as well as (1.16) that 
the Gaussian curvature vanishes for a plane surface (geometry), 


K =0. (1.21) 


For a sphere, on the other hand, we note from (1.14) that (we can 
identify x1 = 0,22 = $, 911 = 900; 922 = 946s 912 = 96) 


— rt gin2 
g=r sin’ 6, 


2900 _ 2900 _ 296s 
00 Od Ob ; 


“a0 = 2r* sin cos 6 = r’ sin 20, 
2 
7 te = 2r2 cos 20. (1.22) 


Thus, for a sphere we obtain 


x = L(_@s00) — 900 (_ ( 2900” 
2g 062 4g? 00 


1 r2 
= ——W (_-2r 56) == —_— | = 7 ain? 00 
2r4 sin? 0 ( ee ) 4r8 sin 6 ( ) 

1 1 


= -—~—, (1-2sin?6) + 


BY (4 sin” 0 cos 0) 
r? sin 


Ar? sin* 0 
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1 
= “ay (1 — 2sin? 8 — cos? 0) 
r* sin 
1 ad 1 


That is, the surface of a sphere is a space of constant (since r is a 
fixed constant) positive curvature. Similarly, one can show that the 
surface of a pseudosphere has a constant negative curvature, 


k= -5 =U: (1.24) 

We can check that the Gaussian curvature is independent of 

the coordinate system used by simply calculating the curvature of a 

plane in polar coordinates. Let us note from (1.18) that, in this case, 
(we can identify x1 = 1,22 = 9,911 = Grr; 922 = 900, 912 = Gro) 


g=r", 

OGrr OGrr Ogee 0 

Or 006 00 , 

0960 

=? 

Or e 

3D” G00 
— os 1.2 
Or? 22) 


Thus, in these coordinates 


_ Lf 8oe0 Grr Ogee \” 
ae rll alee =e) 


= -Z+5 =), (1.26) 


Thus we see that if we can calculate the Gaussian curvature of a 
surface, we can distinguish between the three geometries independent 
of the choice of coordinate system. 
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So far we have been dealing with only two dimensions and hence 
there is only one quantity independent of the coordinate system used. 
This can be intuitively seen from the fact that since the metric is 
symmetric, in two dimensions, there can only be three independent 
metric components. There are, however, two coordinates to represent 
any system. Hence the number of quantities that can be formed 
which are independent of the coordinates used is exactly one (namely, 
3 —2=1). In higher dimensions, say D dimensions, the number of 
independent metric components is 


5 D(D +1). (1.27) 


However, since there are D coordinates to specify a system, the num- 
ber of independent quantities that can be constructed which are in- 
dependent of the coordinate system used is 


5; DD +1)-D=5 D(D-1). (1.28) 


As a result, the study of geometry becomes more complicated when 
D > 2. This general problem was solved by Riemann and was ex- 
tended to physics by Einstein. 


> Example (Pseudosphere in two dimensions). For the pseudosphere in two di- 
mensions we have (see (1.7)) 


d 1- _ 
os) eee 0 2) (1.29) 


a (l= at-2})* 1-3 -)? 
where d(x,y) denotes the distance between the points (#1, v2) and (y1,y2). We 
would like to determine the metric as well as the Gaussian curvature for this 
space. 
For any two vectors A, B in two dimensions, let us introduce the compact 
notation 


A-B=A;,B, = AiBi + AoB2, A? = Aj A; = A Ai + Az Ad, (1.30) 
where 7 = 1,2 and we use the summation convention over repeated indices. Then 


we can write (1.29) as 


epg OD) = (1.31) 
a 


Via /iaae 
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To determine the metric tensor for this space, we need to consider only in- 
finitesimal distances (see discussion following (1.7)) so that let us assume y; = 
x; + dax;,i = 1,2 and we can identify the infinitesimal distance with the proper 
length d(x,y) = ds. Expanding both sides of the equation (1.31) and keeping 
terms to quadratic order in the infinitesimal changes, we obtain 


2 1-22 8 1-22 


a-dx 1 2a - da + dx? 3 (2x-dax + dz? 
(- $8) SE ae oo 


1 da? 1 (a- dx)? 
21-2? 15 (fa gee 2) 


eae), | pee (ea 


=1+ 


where da” = dx- dx. Note that the linear terms (in the differentials) on the right 
hand side of (1.32) in the expansion for the distance cancel and the quadratic 
terms lead to 


ds* 1, dz? zs 1 (a- dx)? 
2Qa2- 21-22 © 2(1—2?)2 
1 L4X; 
= —— ( 6s + — J daidz; 1. 
waa (5 + pda) ars, “en 
where i,7 = 1,2 and summation over repeated indices is understood. This is 


known as the line element for the space and determines the metric components 
for the pseudosphere to be 


2 2 2 2 2 
a“ (1 — 23) L1 X20 a“(1— 27) 
= — = = = ——_——. 1.34 
gil ‘al a 2)? ’ gi2 g21 a = x2)?’ g22 al _ x2)? ( ) 
Starting with the components of the metric tensor in (1.34), we obtain the 
determinant of the metric to be (see (1.20)) 


9 = 911.922 — gh 


at 


Gaal 


1— #})(1 — #3) — aia3] 


at 


= —_.... 1.35 

aa (1.35) 

Next let us evaluate the three expressions involved in the calculation of the 
Gaussian curvature (see (1.19)). For simplicity, let us define A, Biz, and C as 
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2a? 8a? x? 48a? x2.23 
d-2 'O-# "Te 


2a? 4a (1—5a3) — 24a?w3(1 — 23) 
d-2P aa a2 
2a? 4a®(1—Sat) 240? wi(1 — 2?) 
G-eP a —ap a2 
2a? A '(1 + x”) (1.36) 
“7-2? 0-2 0-2’ 
By, — 2922 (9092 Agu) _ ( Age2)” 
nos Ox2 Ox, Ox2 Ox 
= 4a? xvo(1 - x7) 2a?x2 8a7 2722 2a? x2 
a (1 = x2)3 (1 = x2)2 (1 = x?)3 (1 = x2)2 
— 4a?x9(1 — 23) rics 2a7x1 4a?x1(1 — x7) . 
c-2 (-ept a-2, 
dat x? 
=- (1.37) 
_ 9911 Og22 Agr Og22 Ogi2 — Ogir Ogi2 _ Og22 
oe Ox, Ox. ; 0x2 Ox4 Tle Or, = 0X2 : Ot. = Oa, 
_ 4a?x1(1 — £3) 4a7x2(1 — 27) 
““d-2 2) 
2a x2 4a? ro(1 = £5) 
2a a] eo 
2a? x2 8072.03.02? 2a2x2 Aa?xo(1 = x3) 
+ eet aaa coe ay | oo 
8ata12x2 
= Aetna, (1.38) 


Substituting (1.36)-(1.38) into (1.19), we obtain the Gaussian curvature 
to be 


1 gi 922 912 
K=—A-=-= —_-==B — 
2g 4g? 12 4g? 21+ 4g? 
xt al - 2a7(1 + — 
ee | C= ee 
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al — Aa® ai (1 — £3) 7 


Tes 


(1 © 2) 


xyr2a~ 4 
(-#2)2 8a 2122 


1 
= (142? ry(1 r3) r3(1 r}) 2air5) 


a2 
ae 1 


which coincides with (1.24). 


1.2 Inertial and gravitational masses 


Newton’s second law of motion says that any force applied on a 
particle is proportional to the acceleration it produces. Hence we 
can write 


This refers to the inertial properties of the system since the force can 
be any force in general. However, when a particle is subjected to a 
gravitational pull, it experiences a force given by 


F = meg, (1.41) 


where g represents the acceleration due to gravity. Here m; and mg 
are constants of proportionality known respectively as the inertial 
and the gravitational mass of the particle and can, in principle, be 
different. However, if we combine both the equations (1.40) and 
(1.41), we deduce that the acceleration produced by a gravitational 
force has the form 


a= (“#) g. (1.42) 


If the inertial mass does not coincide with the gravitational 
mass (m; # mg) and a“ is not the same for different particles, then 
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clearly different objects can fall with different accelerations under 
the influence of gravity, the pendula of the same length but made 
of different material can have different time periods. On the other 
hand, if ae is the same for all materials, they would fall with the 
same acceleration. Newton tried to measure this ratio with pendula 
of different material and Fresnel improved on it. It has been checked 


by now that (in fact, it holds even up to 107%) 


es = 1 £10", (1.43) 
I 
for different materials. 

Let us examine the experimental set up which measures this ra- 
tio. The experiment was designed by Edtvos of Hungary and consists 
of suspending two weights of different material from a finely balanced 
beam as shown in Fig. 1.3. Here g’ and gf represent the vertical and 
the horizontal components of the centrifugal acceleration due to the 
rotation of earth as shown in Fig. 1.4 (gf is along the surface). 


mag’ mipg 


el; i) 
MIAGs MIBGs 


Mag MeBg 


Figure 1.3: Experimental setup for measuring a 


The beam is in equilibrium so that we have 


£a (mga g— ™Ma 7) = ly (Men g— mpg ) . (1.44) 


Therefore, we can solve for ¢, from (1.44) to obtain 


(Mga g — M1 g') 


(m,n g — My 7) 
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Figure 1.4: Vertical and horizontal components of the centrifugal 
acceleration. 


On the other hand, the horizontal component of the centrifugal ac- 
celeration would lead to a torque 


T lama % _ lgmipgy 


m — mao’ 
ay (aime _g, Meng — ma 9) me) 


7 
Mea J — Mp J 


exgl ne = ae g, (1.46) 


7 
Men J — Mp J 


where we have used (1.45). Furthermore, if we assume that the 
vertical component of the centrifugal acceleration is much smaller 
than the acceleration due to gravity (g' < g), we can write 


T = £4.95 (ms = a 
MB 
m m 
ae MerlaG, ( = = =) . (1.47) 
Mega Mop 
Any inequality in the ratio 7 for the two weights would lead to 


a twist in the wire from which the beam is suspended. No twist 
was observed and E6tvés concluded from this that the difference of 
(or conversely mt) from unity can be at most one part in 10°. 
Dicke and his coworkers later improved the experiment and the limit 
we presently have is one part in 101! (or even 10!%). This is, of 


course, an impressive result because a priori we do not know why 
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the gravitational mass and the inertial mass should be equal to such 
a high precision. (We note that if ae is a constant for all materials, 
then this constant can be absorbed into the definition of Newton’s 


constant and we can identify me = 1.) 


1.3 Relativity 


The Aristotelian view, the Galilean view and the modern view of 
relativity have one point in common. Each assumes that space-time 
is a collection of events and that each event is labelled by three spatial 
coordinates and one time coordinate. That is to say, the space-time 
manifold is a four dimensional continuum. However, beyond this the 
different views differ drastically from one another. 

For example, in the Aristotelian view, events take place at ab- 
solute space and time coordinates and three different kinds of events 
can take place in such a manifold. An Aristotelian observer would 
observe an event A to precede event B or follow event B or happen 
simultaneously with event B. All events that happen simultaneously 
can happen anywhere in space and hence define a three dimensional 
hypersurface. Therefore, the causal structure of space-time is de- 
scribed as in Fig. 1.5. 


future 


simultaneous events 


past 


Figure 1.5: Causal structure of space-time in Aristotelian and 
Galilean relativity. 


On the other hand, if we look at Newton’s law of motion for a 
particle under the influence of gravitational forces, the equation has 
the form 
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d?x mmy (Xv — xX) 
es > ae (1.48) 
where we are assuming that the gravitational force is due to a dis- 
tribution of point masses in space with x, representing the location 
of the mass my. We see that Newton’s law selects out a family of 
inertial frames of reference. For example, let us consider the trans- 
formation 


x — Rx+vt+a, det R= 1, 


t > t4+T, (1.49) 


where R is a rotation matrix (constant, independent of x,t), v, a 
and T are constants. Clearly, then, under such a transformation, 


d?x d?x 
ag “ap? 
(xv — xX) R (xy — x) 


(1.50) 


——— - —. 
[xv — x8 [xv — x]? 


Therefore, Newton’s equation (1.48) remains invariant under the 
transformation (1.49) which is known as the Galilean transforma- 
tion. It is a ten parameter family (group) of transformations (three 
parameters of rotation, three velocity parameters v, three param- 
eters of translation a for the coordinates and one time translation 
parameter JT’) and Newton’s equations are invariant under Galilean 
transformations. We note that Newton’s equation would not be in- 
variant if we go to a rotating or linearly accelerating frame, i.e., if R 
and v are time dependent. 

Galilean transformations select out a family of inertial frames 
in which Newton’s equations (laws) are valid. An observer in such a 
frame would observe an event A to precede an event B or follow B or 
happen simultaneously with B. In this sense it is the same as the view 
of the Aristotelian observer and the causal structure of space-time 
still remains the same as in Fig. 1.5. However, unlike the Aristotelian 
observer, no two Galilean observers in two different inertial frames 
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would agree on the exact position or the time of an event. The only 
thing that they can agree on is the time interval between two events 
and the space interval between them. Furthermore, we note that in 
the Galilean view neither the concept of a constant absolute speed 
nor the concept of a limiting speed exists (namely, frames can move 
with any arbitrary velocity v). 

According to Newton the view of space-time was the following. 
He believed that there is an absolute fixed space-time and the inertial 
forces are due to the frames of reference accelerating with respect to 
the absolute space-time. He tried to justify his view through the 
example of a rotating bucket of water. A great number of arguments 
followed and the one that stands out in particular is known as Mach’s 
principle. This says that the acceleration experienced by a particle is 
influenced by the distribution of matter in space-time. That is, the 
presence of big stars would change the acceleration experienced by a 
particle. We can contrast this with the Newtonian view which would 
say that the presence of big stars does not matter to the acceleration 
of a particle. We will see later during the course of these lectures 
how Einstein’s theory reconciles both these ideas. 

The Newtonian theory along with Galilean relativity, namely 
that the relative separation of events in space and time are meaning- 
ful, met with a large number of success. The place where theoretical 
prediction disagreed with observation was in the precession of the 
perihelion of mercury. The theoretical calculations using Newton’s 
equations yielded a value of 8” /century for the value of the preces- 
sion in contrast to the observed value of 43”/century. (We will do 
this calculation later using Einstein’s general relativity.) 

We note here only two other phenomena that would not conform 
to the views of Galilean relativity (the first was observed much later). 
We experimentally observe that : mesons decay at rest in the labo- 
ratory with a lifetime 7, = 10~° sec. On the other hand, 4. mesons 
are also produced in cosmic ray showers. And it was measured that 
the cosmic ray muons have a lifetime about ten times longer than 
the measured lifetime in the laboratory. Various explanations were 
put forward to explain this, one of which is that the muon lifetime 
changes with the speed of the particle. This was indeed verified in 
the laboratory by producing muons at various speeds. However, this 
poses a serious problem to Galilean relativity since we assume that 
observers in different inertial frames have clocks which read the same 
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time intervals and hence can be compared. The 4. meson example, 
on the other hand, shows that if we regard the muon to be a clock 
(in the sense that it defines a time interval of 10~° sec) then the time 
reading depends on whether the muon is at rest or is in motion. 

The second problem comes from Maxwell’s equations which, in 
vacuum (without sources), take the form 


V-E=0, 
V-B=0, 


1 OB 
VxE=-- —, 
c Ot 

1 OE 
VxB=- —. 1.51 
C 70r en) 
First of all it is straightforward to check that these equations are no 
longer invariant under the Galilean transformations (1.49). Secondly, 


equations (1.51) lead to 


10 
Vx(VXxE)=—~ 5 (VW xB), 
Dippese 2. (Ve 
or, V(V-E)—-VWE= eo hn es 
1 PE 
or, WE=3 37° (1.52) 


This predicts that electromagnetic waves propagate with the con- 
stant speed of light c. A constant absolute speed is, of course, against 
the spirit of Galilean relativity and hence Maxwell postulated that 
electromagnetic waves propagate through a medium called ether. 
That is, one selects out a unique frame of reference to define the 
speed of light. However, experiments failed to detect ether and the 
Michelson-Morley experiment was decisive in establishing that there 
was no ether and that the speed of light nonetheless is an absolute 
constant. 

These conflicts, among other issues, led people to reevaluate the 
Galilean view and Einstein proposed to replace the requirement. of 
invariance of physical laws under Galilean transformations by that 
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under Lorentz transformations. This goes under the name of special 
theory of relativity. The first step in this direction and a great one 
is to realize that space and time should be treated on equal footing. 
Thus one defines a four dimensional space-time manifold as well as 
a coordinate four vector (four dimensional vector) on this manifold 
of the form 


gh (eles = 0.1.2 8: (1.53) 


The length of a vector on this manifold is defined as 


eo = hee”, (1.54) 


where 1),, is the metric of the four dimensional space known as the 
Minkowski space. The metric is symmetric and diagonal with com- 
ponents 


0, if py, 


no = —n = 1, i=1,2,3, i not summed. (1.55) 


Uy 


We can define the inverse metric n“” from the relation 


1 Nyy = OF. (1.56) 


It is easy to see that the inverse metric also has the form 


fw = 0, if wy, 


no = -n® =1, i not summed. (1.57) 


Note that we can raise and lower the index of a given four vector 
with the help of the metric as 


=! V 
Ty = Muvrt , 


Oe! a eas (1.58) 
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where repeated indices are summed and this leads to 


Lp = (ct, —x). (1.59) 


The two four vectors x, x, are known respectively as contravariant 
and covariant four vectors and the length of the coordinate vector 
(1.54) takes the form 


Pai oa =e =a ae (1.60) 


Clearly 2? must be a four dimensional scalar since it does not 
carry any index. (We note that unlike the length of a vector in the 
Euclidean space, here the length need not be positive since it involves 
a difference of two terms.) 

We can ask for the form of the four dimensional transformation 
which leaves this length invariant. It is clear that if 


ct o/h = AM x’, (1.61) 


then the length remains invariant under this transformation only if 


12 / Iv 2 Vv 
Ce Syl a. Sa Sage 
Vy (on V 
or, Nv AM, xP A Qo” = gel 2": 
yo __ 
or, Nv AMA o = Noo: (1.62) 


This has the same form as ATjA = 7 in matrix notation (the su- 
perscript T denotes matrix transposition) which defines rotations 
in this manifold. Therefore, we conclude that rotations in the four 
dimensional space-time manifold which leave the length invariant 
also leave the metric invariant. These are known as homogeneous 
Lorentz transformations (they consist of three dimensional rotations 
and boosts). 

However, in addition to four dimensional rotations if we also 
translate the coordinate axes by a constant amount, we obtain the in- 
homogeneous Lorentz transformations (also known as Poincaré trans- 
formations) under which 
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ch > g/h = AM ge’ +a". (1.63) 


The infinitesimal length interval between two points (as well as the 
finite length interval) remains invariant under this transformation, 
namely, 


ds? = nda de => ds’? = node da” 
= Nw“ dxPAY,dx? 
= MwA" AY,daxPdx? 
= odaxPdax’ = ds* = c’dt? —dx*. (1.64) 


Here ds is known as the “proper length” of the interval. As 
we know, Lorentz transformations mix up space and time coordi- 
nates and hence neither a space interval nor a time interval remains 
invariant under these transformations. However, it is the particular 
combination of the space and time interval given by the proper length 
in (1.64) which is invariant under an inhomogeneous Lorentz trans- 
formation (Poincaré transformation). (We can contrast this with the 
Galilean view according to which different observers will agree on 
both space and time intervals.) 

For a light wave, the wave velocity is given by || = c and 
hence the proper length for the path of a light wave (photon) follows 
from (1.64) to vanish 


ds = 0. (1.65) 


If we go to a different inertial frame the proper length does not change 
so that for a light wave we still have 


de =O: (1.66) 


Hence in a different Lorentz frame the speed of light would still be 
equal to c. That is, Lorentz transformations can accommodate a 
constant absolute speed and the speed of light can be the same ir- 
respective of the inertial frame of reference we choose. This is, of 
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course, the content of the Michelson-Morley experiment. Clearly 
since this represents a departure from the Galilean view, the causal 
structure of space-time must be different. Indeed, in this framework, 
we see that space-time can be divided into four disconnected regions, 
as shown in Fig. 1.6, of the following three distinct types: 


(cx-y)? =(a@-y)y(e—y)4 > 0, time-like, 
(c—y)? =0, light-like, 
(ay)? <0, space-like. 


(1.67) 


space-like 


Figure 1.6: Different invariant regions of Minkowski space. 


In the time-like regions, the propagation of any signal occurs 
with a speed less than the speed of light and hence such regions rep- 
resent causal regions. The light-like regions correspond to regions 
where signals travel at the speed of light. Finally, in the space-like 
regions the speed of propagation of any signal exceeds the speed of 
light and hence, in such regions, no two points can be causally con- 
nected. These three types of regions cannot be mapped into one 
another through a Lorentz transformation. The Einstein view of 
space-time, therefore, is that at every point in the manifold there 
exists a cone which corresponds to a causal future cone where the 
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time evolution of dynamical systems takes place. Furthermore, as 
is evident from the transformations themselves, the notion of simul- 
taneity does not exist in this manifold. Namely, simultaneity is not 
a Lorentz invariant concept. 

Note that the inhomogeneous Lorentz transformations (Poincaré 
transformations) also constitute a family (group) of ten parameter 
transformations (three rotations, three boosts and four space-time 
translations) — exactly like the Galilean transformations. Therefore, 
they do not represent a larger symmetry group. However, the nature 
of the transformations is different. In addition to three dimensional 
rotations Lorentz transformations also involve boosts. If we consider 
only the homogeneous Lorentz group we can further restrict ourselves 
to the proper Lorentz transformations by demanding that 


NG >, det A“, = 1, (1.68) 


representing orthochronous transformations. 

Let us examine how, with these ideas, we can explain the decay 
of the js mesons. Let us assume that an observer at rest observes the 
ticks between his clock to be with intervals dt = At. Therefore, he 
will calculate the proper length to be 


1 
1 2\ 2 
ds = (c?dt? — dx”)? = cdt (1 - =) =cdt= cg, (1.69) 


where we have used v = O for an observer at rest. However, if 
another observer traveling with a velocity v observes the ticks to be 
of interval dt/ = At’, then he will calculate the proper length to be 


1 
v2\2 
= Ce. (1-5) (1.70) 


However, since the proper length is an invariant quantity we have 
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ds’ 


ds, 


! v? 3 
or, At = At 1-4 = yAt, (1.71) 


where we have defined 


ie)” (7) 


This shows that the faster an observer is moving, the longer would 
be the time interval between the ticks of his clock. (The time in- 
terval in a clock depends on the speed with which it is moving.) 
This explains why the ~ mesons in cosmic rays have a longer life- 
time. The constant y is conventionally known as the Lorentz 
factor. 

We show next that Maxwell’s equations in vacuum without 
sources are covariant under Lorentz transformations (they are not 
invariant under Galilean transformations). But before doing that let 
us introduce the compact notation for four dimensional derivative 
four vectors (we are going to assume c = 1| from now on). The four 
dimensional covariant derivative is defined as 


i) 0 


which leads to the contravariant derivative 
O O 
fe = [| — , = 2 1.74 
eee ( v) a) 


Furthermore, we note that 


v o” 
0," = "8,0, = 55 -V? =O, (1.75) 


which is known as the D’Alembertian, is invariant under Lorentz 
transformations. 
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To show that Maxwell’s equations are Lorentz covariant, let us 
combine the electric and the magnetic fields into a Lorentz tensor in 
the following manner. Let us define a second rank anti-symmetric 
tensor F,,, = —F,, (it has six independent components in four di- 
mensions) with 


Foi = FE, = (Bs, ae 
and 
Fy = —€j~Be= —€ijk(B)x, 
1 
or, By = (B= =) €igkl jk ey) 


where €;;, denotes the three dimensional Levi-Civita tensor. With 
this identification, two of Maxwell’s equations in vacuum without 
sources (see (1.51)) can be written as 


OF, = 0. (1.78) 


To see this let us note that this actually reduces to two sets of equa- 
tions. Namely, for v = 0, we have (Fo9 = 0 by anti-symmetry) 


OFF ig = 0, 
or, OE = 0, 
or, (—-V)-(-E)=V-E = 0. (1.79) 


Similarly, for v = 7, we obtain 


OME = 0; 
or, O° Fo; + OF; = 0, 
or, i + O' (-€;.B,) = 0, 
or, —-VxB=0, 
or, VxB= oe (1.80) 


ot 
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These only represent two of the four equations of Maxwell in 
(1.51). To obtain the other two, we note that given the field strength 
tensor F’,,,, we can define the dual field strength tensor as 


~ 1 
» 
Pw = Ewrpl i 


; Ga Pps (1.81) 


1 
5 
Here €#”? (or Eup) defines the four dimensional Levi-Civita tensor 
with €°!?3 = 1. The dual field strength interchanges the roles of the 
electric and the magnetic fields and we have Fo; = — S6ijn Lik = B; 
and Bi; = €ijrE (where we have used the fact that Ik coincides 
with the three dimensional Levi-Civita tensor €;;,). The other two 


equations of Maxwell in (1.51) can now be written as 


OF 4 = 0. (1.82) 


Equation (1.82) can, in fact, be explicitly solved to determine 


Fv = OwAr = OvAn, (1.83) 


where A,, is known as the (four) vector potential. (It contains the 
three dimensional scalar and vector potentials as A“ = (¢,A).) 
Thus, the set of four Maxwell’s equations in (1.51) can be written 
either as 


HF y= 0= HF yy, (1.84) 


or equivalently as 


OF, =0, Fy =0,A,— OA, (1.85) 


Before we examine the Lorentz covariance of Maxwell’s equa- 
tions, let us discuss how various tensors transform under a Lorentz 
transformation. We already know how a (contravariant) four vector 
transforms (see (1.61)), namely, 


oY > Mn”, (1.86) 


28 1 BASICS OF GEOMETRY AND RELATIVITY 


This immediately tells us that under a Lorentz transformation, a 
second rank contravariant tensor U“” would transform as 


ue > MAYO, (1.87) 


Similarly any higher rank contravariant tensor would transform as 


eee =. MM My MAN aaa eee (1.88) 


This is interesting because we can now ask what information can 
be derived from the transformation property of the invariant (con- 
travariant) metric tensor, namely, 


nv —» NEN a? (1.89) 


Since the metric is an invariant tensor, it does not change under a 
Lorentz transformation. (Transformations which leave the metric of 
a manifold invariant are also known as isometries of the manifold and 
Lorentz transformations define isometries of the Minkowski space- 
time.) This implies 


ni’ —» Mae Ses (1.90) 


which is what we had seen earlier in (1.62). Alternatively, we can 
write this as 


AP kee =n” or, equivalently APY 7 ae (1.91) 


Similarly, the inverse (covariant) metric 7, is also an invariant tensor 
and one can show that this leads to (this also follows from (1.91) by 
lowering the indices with the metric tensor) 


AyrA) =Nwv, Or, equivalently Nh =a (1.92) 


Once we know the transformation properties of contravariant 
tensors as well as the fact that the metric tensor is invariant, then 
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we can calculate how the lower index (covariant) objects transform 
under a Lorentz transformation. For example, 


= 2, Vv rv 
Lu = Nw h > NwA’yx 


= hoe ae. (1.93) 


Remembering the relationship (see (1.91)) 


NAY = nh”, 
or, AXA = 64, or, equivalently A,“A*,= 64, (1.94) 


we see that, under a Lorentz transformation, the lower index (covari- 
ant) vectors transform in an inverse way compared to the upper index 
(contravariant) vectors. Therefore, we distinguish between them by 
saying that 


Ag contravariant vector, 


U RYN. 


— 

Ly —> covariant vector, 
—  contravariant tensor, 
— 


Si ae covariant tensor. (1.95) 
We are now in a position to calculate how a covariant tensor 
transforms under a Lorentz transformation 


Uw + A, APU», 
Ow rp. APA AS AP 0 OatutM pf (1.96) 


In addition, we see that we can have tensors of mixed nature. A ten- 
sor with m contravariant indices and n covariant ones would trans- 
form under a Lorentz transformation as 


/ 


ier ae a aa Le CTO 
1 m 


seplh? 
Vy Uy 


[HA Hm 


Vien 
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We also see now that the length of a vector transforms as 


x =z,0" —+ A 


= Oryx = oye =a", (1.98) 
where we have used (1.94). Namely, the length of a four vector is a 


Lorentz scalar as we have observed earlier. 
Furthermore, let us note that since 


gh go NE ae (1.99) 


we can write 


he Pa (1.100) 


which follows from (1.94), namely, 


Jp _ V 
gS Ae. 


or, Ao = AA a” = x’ = 2, 
or, p= hare 
or eae. (1.101) 


Similarly, since 


Ly 72, =A,’ ay, (1.102) 


using (1.94), it follows that 


Laan eer a (1.103) 


We are now in a position to calculate how the four derivatives 
transform under a Lorentz transformation. For example, 
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ip ia” De 0 _ Ox, O 
a Ob, Oxi, a Oxi, Oxy 
= A a. = A* 0” (1.104) 
V Oxy V ’ 


where we have used (1.103). Similarly, using (1.100) we obtain 


seis O 0 = da" -0 
Me Ort ia Ox!#& Oa! OxY 
Vv 0 V 


Thus we see that contragradient and the cogradient vectors transform 
like contravariant and covariant vectors respectively. From these, 
it now follows that the D’Alembertian (1.75) is invariant under a 
Lorentz transformation. 

We are now in a position to discuss the Lorentz covariance of 
Maxwell’s equations. First of all we note that since the field strengths 
are tensors of second rank, under a Lorentz transformation, they 
transform as (see (1.96)) 


Pi NINE Es 
Fy > APAPF yy. (1.106) 


Thus, 


Oy. = OFF EN OF AKER, 


r / 
= MAAN? dM Fy, 


BVA? OY Fy, = AP OF \y. (1.107) 


This shows that if 


OF, =0, (1.108) 
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in one inertial frame of reference, the form of the equation does not 
change in a Lorentz transformed frame of reference. Similarly, we 
can also show that the equation for the dual (1.82) 


OF, =0, (1.109) 


also maintains its form in any inertial Lorentz frame so that Maxwell’s 
equations are Lorentz covariant. We should note here that since 


Bay => MP APR ys, (1.110) 


and the transformation matrices AA are not necessarily diagonal, 
Lorentz transformations (boosts) mix up different field components. 
This is another way of saying that under a Lorentz transformation, 
the electric and the magnetic fields mix with each other which we 
are all familiar with. 


> Example (Null basis vectors). As we have seen in (1.67), in Minkowski space 
vectors can be time-like, space-like or light-like (null). Let us ask if it is possi- 
ble to find four linearly independent null (light-like) vectors in four dimensional 
Minkowski space which are also orthogonal. 

It is quite easy to find four linearly independent null four vectors. For 
example, we can have 


ef = (1,1,0,0), eS = (1,0, 1,0), 


es = (1,0,0,1), ef = (1,-1,0,0). (1.111) 


This set is obviously linearly independent and each vector is easily checked to be 
light-like (null), namely, n,vei'e¥ = 0,i (not summed) = 1,2,3,4. Such a set 
defines a basis for the Minkowski space. In fact, this is the light-cone basis for a 
photon. Note, however, that these basis vectors are not orthogonal. 

It is not possible to find a set which is in addition orthogonal and null for 
the following reason. We will attempt to build an orthogonal basis out of a set 
of null vectors via the Graham-Schmidt procedure. To that end, take any of the 
null vector and label it vi’. vii necessarily has a time component and a space 
component (it is the only way it can be null but non-zero) so we can write 


vy =(1,vi), virvi =. (1.112) 


To get the second vector vs we know it, too, must have a time and space compo- 
nent so up to a multiplicative constant we can write 
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vf =(1,v2), ve-ve=1. (1.113) 


If we now impose orthogonality between the two vectors (1.112) and (1.113), we 
obtain 


v1-v2 =1—vi-ve =0, (1.114) 


which implies 


Vi-vo=1. (1.115) 


But, vi, v2 are (three dimensional) unit vectors so that (1.115) implies that 
vi = v2 which would amount to their not being linearly independent. This 
proves that it is impossible to find a set of four linearly independent light-like 
(null) vectors in Minkowski space which are also orthogonal. 

< 
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CHAPTER 2 


Relativistic dynamics 


2.1 Relativistic point particle 


It is clear that Newton’s equation which is covariant (form invari- 
ant) under Galilean transformations, is not covariant under Lorentz 
transformations. So if we require Lorentz invariance to be the invari- 
ance group of the physical world, we must modify Newton’s laws. Let 
us note, however, that since Newton’s laws have been tested again 
and again in the laboratory for slowly moving particles, it is only 
for highly relativistic particles (|v| ~ c = 1) that the equation may 
require any modification. 

We have already seen that in Minkowski space we can write the 
line element as 


de adr Sy ydeds=dit dx, (221) 


where drt is known as the “proper time” of the particle and is related 
to the proper length through the speed of light (ds = cdr) which 
we have set to unity. The proper time, like the proper length, is an 
invariant quantity (since the speed of light is an invariant) and hence 
can be used to characterize the trajectory of a particle. To describe a 
Lorentz covariant generalization of Newton’s equation, let us define 
a force four vector f such that 


d2zt 
dr? 


= f¥, (2:2) 
where m represents the rest mass of the particle. Clearly since the 
proper time does not change under a Lorentz transformation, equa- 


tion (2.2) takes the same form in all Lorentz frames. Furthermore, if 
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we know the form of f“, we can calculate the trajectory of the parti- 
cle. To understand the nature of this force four vector, we note that 
the form of the relativistic force four vector can be obtained from 
the observation that since (2.2) holds in all Lorentz frames, it must 
be valid in the rest frame as well. In the rest frame of the particle 


(% =0) 


dp =di, (2.3) 


and hence (2.2) takes the form 


d?a: i 
ah dt2 = Lees) 
Flreat) = 0. ae 


Furthermore, if we define the relativistic four velocity of a par- 
ticle as 


= dx 2 dt dx dt dx 


Py (Ne (ee af] 2.9 

u dr ae a! ? qe? a ,V), ( ) 

then in the rest frame of the particle we have Urrest) = (1,0,0,0). 
Therefore, in the rest frame we obtain 

Ficest) (rest) Mev = feat = 0. (2.6) 


On the other hand, this is a (Lorentz invariant) scalar equation and 
hence must be true in any inertial reference frame. Secondly, we see 
explicitly that 


Ulrest) (vest) ay = Wrest) = 1: (2.7) 


This must also remain true in all inertial reference frames as can be 
seen from (2.5) using the definition of the Lorentz factor in (1.72). 
That these two equations are compatible can be seen in the following 
way. If 
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aru Ty — l. (2.8) 


it follows that 


d p 
— (uMu" nv) = 0, 


dr 
du 
or, res Ve = 0, 
or = Pai = 0 
’ m LV ’ 
or, feu’ nu = fPup = 0. (2.9) 


Here we have used the fact that 


d2a! dul 
—_ = 
fe =m oe m a (2.10) 


Equation (2.9) shows that the relativistic force (acceleration) has to 
be orthogonal to the four velocity and, as a result, not all components 
of the force four vector are independent. 

In analogy with the non-relativistic case, we can define the four 
momentum of a particle as 


p= mu", (2.11) 


so that the dynamical equation (2.2) (or (2.10)) can also be written 
in the familiar form (the rest mass m is a constant) 


ica 


init (2.12) 


We note that the space components of the four velocity are given by 
(see (2.5)) 


dr 
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al 
where y = (1—v*) ? is the Lorentz factor introduced in (1.72) 
(remember c = 1). It follows now that for slowly moving particles 
(v=|v| <1) 


u=v+0(v*). (2.14) 
Furthermore, 
dt _i 1 
iS Shae) FSi OO (2.15) 


Thus, for slowly moving particles, we obtain from (2.11) 


p = muemy, 
— Cue 12 
p = muemt 5 mu. (2.16) 


It is clear that p® is nothing other than the (non-relativistic) energy 
of the particle (including the rest energy) and (2.11) reduces to the 
familiar energy and momentum relations of non-relativistic physics 
(in the proper limit). Thus we see that the energy and momentum 
are unified in this picture into a single four vector p” such that 


p= (Pop). (2.17) 


It follows from this that 


(mul) (mu) thw 


mut’ nu, =m’, (2.18) 


p= pp" we 


where we have used (2.8). Therefore, the energy and momentum of 
a relativistic particle satisfy the condition 


p’ = pp, = E? — p? =m’, (2.19) 


which we recognize to be the Einstein relation. 
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Once we know the components of the force in the rest frame, we 
can calculate them in any other frame. Thus, for example, denoting 
the rest frame components of the force by (see (2.6)) 


fos = F° =0, 
a = (2.20) 


in an arbitrary frame the components of the force take the form 


vie = FOE, 
(v-F) 


f = F+(y-1)v - 


(2.21) 
where v represents the instantaneous velocity of the particle (frame) 


and v = |v|. Here we have used the fact that an arbitrary four vector 
A transforms under a general Lorentz boost as 


Ale = y(A°+v-A), 
A! = A+(7- jp + 7vA°, (2:22) 
v 


Recalling that u“ = (y,7v) in an arbitrary frame (see (2.5)), we note 
that 


flu, = fou®-f-u 
= Yv- F-{w-F+ 7(7-)v-F} 
= 0, (2.23) 


as we would expect (see (2.9)). 


> Example (Transformation of force). From the Lorentz transformation matrix 
(1.61), we would like to derive that under an arbitrary Lorentz transformation 
(boost) a contravariant vector transforms as (see (2.22)) 

A'°=y(A°+v-A), 


v(v- A) 


A’=A+4+(y7-1) +yvA°, (2.24) 
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where v denotes the boost velocity, v = |v| and the Lorentz factor is given by 
(see (1.72) in units c = 1) 


y= (1- wy? (2.25) 


Furthermore, using this as well as assuming that in the rest frame the force four 
vector has the form 


Spans = 0, Sixest) = FY, (2.26) 


we would like to show that in a frame where the instantaneous velocity of the 


5 ‘ _ dx 
particle is v = |, 


‘i = v- F= v- forest), 


v-F 
f=F+(y-lv at (2.27) 


We recall (see (2.5)) that the proper velocity of a particle (in any frame) is 
defined as 


wb 
gue (2.28) 


ar 


leading to the components 


0 dx°® = dt re 
~G dr” 
dx dt dx 
= a ee 2.2 
dr drdt oe) 
where v = oe represents the instantaneous velocity of the particle and y = 


(1 — v?)~1/? denotes the Lorentz factor. It follows from (2.29) that in the rest 
frame of the particle where v = 0, we have 


Ufrest) = (1,0, 0,0). (2.30) 


Since the proper velocity is a four vector, we can relate its components in 
any frame to the rest frame through the Lorentz transformation (1.61) 


wu 


uU= AM ,UCest)» (2.31) 


and comparing (2.29) and (2.31) (with the use of (2.30)) it follows that 


Mo =y, No =w. (2.32) 
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Furthermore, from the orthogonality relation satisfied by the Lorentz transfor- 
mation (1.62) (or (1.92)) 


AM APA (2.33) 


we conclude that 


ae Sige nin 


or, A° pA? + AP jAv =1, 
or, (A°,)? = (A 9)? -1= 9? -1= oy = 7°, 


or, A°;=W', (2.34) 


where we have chosen the positive square root for simplicity (v can take positive 
as well as negative values). The other components of the transformation matrix 
can also be determined similarly. However, this is all we need to transform the 
force four vector. 

We know that 


Foca = 0, on = F’. (2.35) 


From the form of the transformation matrix, we can now determine that in any 
frame 


‘id = A° vf (rest) = A° of ae a A° if (rest) 
=0+ yv-F=w-F. (2.36) 


The spatial component of the force can be easily determined from the orthogo- 
nality relation 


Tv" fY =u f? —u-f =0, 
or, yv- F-y-f=0, 


or, v:(yF-f)=0. (2.37) 


This determines 


f= 7F +f, (2.38) 


where f; denotes a component perpendicular to the velocity v. The unique 
orthogonal vector that one can construct from two vectors F and v is given by 
(up to an overall multiplicative factor) 
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_ v(v-F) 


F 2 


(2.39) 


As aresult, we determine the unique spatial component of the force in an arbitrary 
frame, which also has the correct rest frame limit, to be 


f= 7F +f, =7F+(1 v(F ae 


v2 


v(v- F) 


=F+-)——. (2.40) 
We can write (2.36) and (2.40) together as 
f° ie Sie F= Nes frest)s 
F 
f=F+(y-l v5 ; (2.41) 
as discussed in (2.21). 
< 


> Example (Combination of velocities). Observer O, moves with velocity vp 
with respect to Os who moves with a velocity vc with respect to Oc. The 
coordinate systems are related by the Lorentz transformations 


ta A“, (ve)@p, 
ait = M (vo), (2.42) 


where A‘,(v)’s represent the respective Lorentz transformations (boosts). We 
would like to determine the velocity of Oa with respect to Oc, namely, the ad- 
dition formula if all the velocities are in the same direction, say along the x 
direction. 

We note from (2.42) that we can write 


ak = M(vp)M,(ve)at, (2.43) 
and without loss of generality we can assume that the particle is at rest in frame 
C so that 
x = (tc, 0,0, 0). (2.44) 
From (2.34) we know that the Lorentz transformation matrix between any 


two frames moving with velocity v has the form (A’;(v) can also be determined 
in a similar manner) 
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(2.45) 


Denoting the instantaneous velocity in the frame A to be va, it follows now from 
(2.42) as well as (2.45) that 


dt, dx”, 


qe = A°,. (up) AX, (vo) = A®,, (vs) A" (vc) 

= A°y (vp) A°9 (vc) + A°,(up)A‘o(ve) 
= 7(ve)7(we)(1 + ve - vo), 

i, da, 7 dag i dic 

a= ae = Malen) AY (0c) Ge = Ay (vs)A"o (ve) = 
= (A'9(vp)A°9(vc) +4 (va) Mo(ve)) 5° 
= [r(om)r(oe)e ay (5; — (y(vs) — a ) xfve)ee| =< 
~ [r(om)el +e + (y(vs) - y= | iG es) 


(2.46) 


In vector notation the velocity va can be written in terms of vp and Vc as 


VB° Vc 1 VB°Vc 1 
ie 7S a ———. _ (24 
VA ( + a ) Ve + Ge) (v- -B ve) cence (2.47) 


If the velocities are all in the same direction (say along the x-axis) then the 
second term in (2.47), which represents vc. (with respect to vs), will drop out, 
leaving the familiar expression for the addition of velocities 


Up + Uc 
Va = 


eayeienie 2.4 
ices (2.48) 


where the vector notation is no longer relevant since the problem is now one 
dimensional. We can recognize the transformation in (2.48) as being identical to 
the relation for the sum of hyperbolic tangent of angles, 


tanha + tanhb 
tanh b) = ——__—__.. 2.4 
aaah) 1+ tanhatanhb 2) 
In fact, if we parameterize the velocities in terms of angular parameters (known 
as rapidities) as 
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va =tanhOs, vg =tanhdg, vc = tanhd@c, (2.50) 


then it follows from (2.48) and (2.49) that 


tanh dp + tanh6c 
tanh 0, = —————————- = tanh (0 0. 2.51 
cates 1+ tanh 6, tanh 06 anh 4 + 8c), 2p) 


which leads to 


On => On + Oc. (2.52) 


In other words, while the addition of velocities (see (2.50)) have a nonstandard 
form, rapidities or angles representing velocities have a standard composition. 
This is analogous to the fact that while the slopes in Euclidean space have a 
nonstandard composition, the angles representing the slopes simply add. 

< 


> Example (Proper acceleration). Let us consider the relativistic motion of a 
particle along the x-direction where the particle undergoes a uniform (constant) 
linear acceleration. We would like to show that the motion in this case leads to a 
hyperbolic trajectory (hyperbolic motion) and compare this with the correspond- 
ing motion in the non-relativistic case. 

Since we have motion along just one spatial direction (a-axis), we can write 
the proper velocity as (see (2.5)) 


da" 


wT 
UU — 
dr 


y(1,v) = (7, yr), w=0,1, (2.53) 


where v denotes the instantaneous velocity of the particle 


dx 
y= 8, (2.54) 


Since the Lorentz factor (1.72) has the form 


NIF 


ya (1—oy 7}, (2.55) 


it follows that 


1470 =’. (2.56) 


Furthermore, since we have 


dy 3 dv 


7 = 48y (2.57) 


we obtain the proper acceleration to be 
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du¥ d dy dyv 

be SSE oe, a Pee | (uot ie lace 

GS gO HV a ae) 
“s 4 du 2 22 du = 4dvu 


where we have used (2.56). It follows from (2.58) that 
dv \* dv \?* 
2... vn 2 Sf ser eee pecs 
a = nva"'a” = (vu — 1)y ($7) y (7) <0. (2.59) 


In the instantaneous rest frame of the particle where 7 = 1, we can write 
this as 


Bes UN aes (2.60) 
c= vg Mee a : 


where the instantaneous acceleration is defined as 


d(yv) _ 
=e (2.61) 


So far our discussion has been quite general. We will now consider the case 
of constant acceleration (a =constant). In this case, (2.61) leads to 


d(yv) 
dt 


= a= constant, 
or, yvu= at, 

2 
or, af =v," 


or, 140°? =14 70? =7 = (1-07), 


or, vi=l : = ott? 
, a 1+a2t? 1+ a2t?’ 
at 


aay (2.62) 


or, v= 

Here we have used (2.56), set the constant of integration to zero as well as taken 

the positive square root for simplicity. This choice of the integration constant 

corresponds to choosing the initial velocity to vanish, namely, v = 0 when t = 0. 
On the other hand, the instantaneous velocity is defined as (see (2.54)) 


_ dx 


which together with (2.62) leads to 
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dx at 


dt (1 + a2#2)1/2” 


1 d(a*t?) 
or, adx = 5 (4 atti? (2.64) 
Equation (2.64) can be integrated to give (we assume x = 0 when t = 0) 
ax = (1+a7t?)'/? -1, 
or, (ax+1)? =a*a? + 2ar4+1=1+4 078, 
2 2 
or, ax +2x—at~ =0, (2.65) 
which describes a hyperbola with asymptotes 
1 
ep Sat. (2.66) 
a 


Equation (2.65) should be contrasted with the nonrelativistic parabolic motion 
of the form (with the same initial condition « = 0 = v when t = 0) 


(2.67) 


Finally, we note that we can carry out the complete analysis in terms of the 
proper time as well and write all the variables as functions of the proper time as 


v = tanhar, y= (1—v’)7'? = coshar, 
pemer ra (cosh a7 ~ 1) (2.68) 
a a 


2.2 Current and charge densities 


Let us consider the motion of n charged particles with charge q,, at 
the coordinate x,. In this case, the current density associated with 
the system is given by 


(2) = Yan PO 5 Ge — xe, (8). (2.69) 


Similarly, the charge density of the system is given by 
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plat) = Y> and? (x — xn (t)). (2.70) 
n 


If we identify x°(t) = t, then we note that we can write the charge 
density also as 


p(x) do 9nd?(x — Xn(t)) 


So dn i dt’ 5°(x — x,(t’)) 6(t — t’) 
S° dn / dt’ 64 (x — x(t’) 

dt! 
= Yn fo a 64(a — %n(T)) 


fay 7: 
= far ay 2 Hen), (2.71) 


where in the intermediate step we have changed the integration over 
t’ to that over the proper time 7. Similarly, the current density can 
be rewritten as 


(0) = oan PHL 68 (x — nl) 


=f arn PA 5H e—an(r)). (2.72) 


It is suggestive, therefore, to combine both the current and the charge 
densities into a four vector of the form 


= f oa FO oe —anln), (2.73) 
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where the time component of the current density four vector in (2.73) 
is the charge density of the system while the space components yield 
the (three dimensional) current density. The four vector nature of 
the current density is obvious from the fact that the proper time does 
not change under a Lorentz transformation and 


1 


64(x) = 5*(2), (2.74) 


where in the last step we have used the fact that we are consider- 
ing only proper Lorentz transformations with det A = 1 (see (1.68)). 
Therefore, we see that J’ has the same Lorentz transformation prop- 
erties as xz“ which is a four vector. 


Furthermore, we note that 


6) 
— JH 
aun 2) 


7 
O 
= on | ar Lo 


xh (rT 
frdn g nk ) at. 64 (x — t,(T)) 


0, J" (a) 


64 (a — £n(rT)) 


dah (r) 
dt 


Ox! 


= [Xo dents) (- mates 5 (x - 24(7))] 
fad i (-< 5 (a — en())) 
/ ary (*) Cee Cea) (2.75) 


Thus we see that the current is conserved (gq, is a constant) and, 
written out explicitly, equation (2.75) has the form of the familiar 
continuity equation, namely, 
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OS) 
or, doJ° + 0, J* = 0, 


or, 2p +V-J=0. (2.76) 


This shows that the continuity equation (2.75) or (2.76) is Lorentz 
invariant (holds in any Lorentz frame). 

Once we have a conservation law (conserved current density), 
we can define a conserved charge as 


= - Bx J°(x,t) = Q(t). (2.77) 


That this is a constant in time (conserved) can be seen as follows, 
10 fa, A(x, 1) 
ot 


di 
[es Pol) U5) = f ae 93GeD) = (2.78) 


where we have used the continuity equation (2.76) and the right 
hand side vanishes because of Gauss’ theorem if the currents vanish 
at large separations. Thus, we obtain 


dQ 

dt 

We can further show that the conserved charge Q is not only 
constant but is also a Lorentz scalar in the following way. Since 


=0. (2.79) 


Q= [ave D(a), (2.80) 


and since J° behaves like the time component of a four vector under 
a Lorentz transformation, the charge must transform as 


Q~ pas = Lorentz invariant. (2.81) 
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A more rigorous way of seeing this is as follows. Let us consider the 
integral 


fe / d4e 3, (J#(x,t)6(n-2)), (2.82) 


where n“ is a time-like unit vector and 


1 ifz>0, 
Bias 2.83 
¥) © AP e210: Ca 


represents the step function. Therefore, writing out explicitly we 
have 


T= [es [do (J°(x, t)0(n + x)) + 0; (J*(x, t)O(n-x))]. (2.84) 


The spatial divergence, of course, vanishes because of Gauss’ theorem 
so that we can write 


fe / de a (J°(x,t)0(n-2)) (2.85) 


If we now identify n“ = (1,0,0,0) and do the time integral then we 
have 


—_ / dt dx & (I(x, 1)0 (t)) 


t=oo 


‘ 3x, J°(x, t)0 (t) 


= [ee Pie $=) =~ OS) =o, (2.86) 


since Q is a constant. Thus we conclude that we can write 
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Q [ate Oy (JM (x,t) (n- x) 


zs [ae (I,J"0(n- a) + J#A,,0(n + x)) 


/ dz J*(x)3,0(n +2). (2.87) 


The Lorentz scalar nature of Q is now manifest since both d+z and 
6(n- 2) are Lorentz invariant quantities. 


2.3 Maxwell’s equations in the presence of sources 


If there are charges and currents present, then we know that Maxwell’s 
equations take the form (c = 1) 


V-E = p, 
V-B = JQ, 
OB 
E = -— 
V x a 
OE 
B= 2: 
Vx a td: (2.88) 


where p and J denote the charge density and the current density 
respectively (produced by the charged matter). We have already 
seen in (2.73) that p and J can be combined into a four vector 


J" = (p,J). (2.89) 
Furthermore, we have also seen in (1.76) and (1.77) that the electric 
field E and the magnetic field B can be combined into the field 
strength tensor F,,,, such that 

fo = Ei, 


Fj; _ —€jjr Bp. (2.90) 
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The dual field strength tensor in (1.81) leads to the components 


Fy = Bi, 


Fj; = €ijk Lk. (2.91) 

As a result, in the presence of sources we can write Maxwell’s 
equations (2.88) as 

OF, =J,, OF, =0. (2.92) 

That these give the correct Maxwell’s equations can be seen as fol- 


lows. Let v = 0 in the first of the two equations in (2.92) which leads 
to 


OFF a= Jo; 
or, OE = ?p, 
or, (-V)-(-E)=V-E=p. (2.93) 


Similarly, choosing v = 7, we obtain 


Fg = Jj, 
or, O° Fo; + OF; = Jj, 

OE; ., 
Or, a +0 (—€i;~Br) = J; 

E 
or, + (-¥) xB=-d, 
E 

or, VxB= a + J. (2.94) 


These give two of Maxwell’s equations with sources in (2.88). The 
other two equations which are unaltered in the presence of charges 
and currents (compare with (1.51)) are still given by the equation 
involving the dual field strength tensor (as in (1.82)) 
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oUF. = 0, (2.95) 


which can also be written as the Bianchi identity 


On Lo + OVP \y + OD Fay = 0. (2.96) 
(The Bianchi identity can be explicitly checked to be true from the 
definition of the field strength tensor in (1.83).) 


The Lorentz covariance of the equations can be seen from the 
fact that 


if Xr 
OF, > MF, = (AM ,0") (ADA?) Fay 
Xr / 
St ARN ONO Fi 
=. NAO" Fy 
= APO B= NO Fs: (2.97) 


Similarly, 


Jy ed yl I ps (2.98) 


Therefore, it is clear that the equation 


OF, = Jy, (2.99) 


maintains its form in all Lorentz frames. In other words, if the above 
equation is true in one frame, then in a different Lorentz frame the 
equation has the same form, namely, 


OP aay (2.100) 
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2.4 Motion of a charged particle in EM field 


If a charged particle with charge q is moving with a non-relativistic 
velocity v in an electromagnetic field, then we know that it experi- 
ences a force (Lorentz force) given by (c = 1) 


Fur =¢q(E+vxB). (2.101) 


If the particle is at rest then, of course, the force is purely electrostatic 
and is given by 


F (rest) = gE. (2.102) 


We note that we can generalize this to the relativistic electro- 
magnetic force experienced by a particle in the four vector notation 
as 


lu py dav [ey 
i Sul a qFuu’. (2.103) 


For example, this force is by construction orthogonal to the four 


velocity (as we would expect, see (2.9)) 


ft, qr Ue t= Ghai =, (2.104) 
which follows from the anti-symmetry of the field strength tensor. 
Furthermore, in the rest frame of the particle (& = 0), it is clear 
that 

da” dx’ 
0 0 _ 70 = 
Temeat) = qi, az = 0, 
F . dr’ 
Forest) = 9Py 
» dae? . dai 
_ fic ae Ai Me ee 
a( 0dr nb dr ) 
. ge 
= qF'y ae Fo = —qFio = Foi = Ci, 


or, E(rest) = qk, (2.105) 
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as we would expect. Here we have used the fact that in the rest frame 
of the particle x° = t = rT. For a non-relativistic particle (v < 1), on 
the other hand, we note from (2.103) that 


: : — 
finn) = gE'y eee 


“dt dr dt dr 


(Foi + €ijnBav’) ; 


ta dx° dt dai a 
pps 
1a 


dt 
or, finn) = Ia (E+v x B) =q(E+vxB) 
= g(E+vxB)+0(v*). (2.106) 


Furthermore, the time component of the force has the form 


0 
four) = gy ae 


= qv Ey 
= v- fim =V- forest) +0 i? hs (2.107) 


which we can also identify with 7v - firest). This, of course, also 
gives the time rate of change of the energy of the particle in an 
electromagnetic field. Thus, we see that the force four vector (2.103) 
reduces to the appropriate limits and, therefore, we conclude that 
the correct relativistic form of the equation of motion for a charged 
particle in an electromagnetic field is given by 


2 ht He He v 
d's ee eee) ey 2 dx 


— = qF"u’. Ql 
ov dr? dr dr ’ dr aus 208) 
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The non-relativistic form of the equation can be obtained in the 
appropriate limit by using the components of the forces derived in 
(2.106) and (2.107). 


2.5 Energy-momentum tensor 


Just as for a distribution of charged particles we can define a charge 
density and a current density and combine them into a four vector, 
similarly for a system of particles with nonzero momentum we can 
define the following densities, 


Lee = See (x — Xn (t)) 


l| 
Qu 
4 
3 
& 
a. 
=a 
4 
—" 
a 
KS 
8 
3 
—~ 
4 
a 
— 
bo 
em 
S 
to} 
al 


4 dz, t 
Tawa) = Sook EO 6 — al) 


l| 
a 
4 
3 

& 
a |S. 
a beat 

4 

Trina 
or) 

w 
— 

| 

* 
— 
4 
eZ 
lor 
— 
oe 

| 

8 
oS 
> 
os 


nu Sen) 54 (0 —2n(T)). (2.110) 


I 
ee 
a 


Therefore, we see that we can combine the two expressions in (2.109) 
and (2.110) to form a second rank tensor density 


Vv day 4 
Tae = f av Dow GE Oe —20(0). (2.111) 


Furthermore, if we use the definition of the momentum for a rela- 
tivistic particle (see (2.11)) 


dat 
pi =m a (25112) 
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we see that the tensor T#” in (2.111) is a symmetric second rank 
tensor density and can be written as 


= Tees : (2.113) 


This is known as the stress tensor density or the energy momentum 
tensor density of the system of particles. 
Let us next note that 


0 
py = pL 
Ojil (nathan) ~~ Ort Dacian) 


_ dah, din <4 
7 sam f ar a dr Oe Slt) 


= fer S- f%5" (@ — an(T)). (2.114) 


We note that, unlike the current density in (2.73) where the coef- 
ficients qn are time independent (see also (2.75)), here ph’s are, in 
general, time dependent. Therefore, only if the total force acting on 
the individual particles of the system is zero, then we have 
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Onl. =) (2.115) 


(matter) 


Namely, the energy momentum tensor for matter in (2.111) or (2.113) 
is conserved only in the absence of external forces and in this case 
we can define the charge associated with this tensor density as 


3 0 3 0 
Tate w= fa 7 Lee ic 7 Do ass (2.116) 


which is easily seen to be conserved (see (2.78)), namely, 


dP" 


ian a 


dt 
We note from (2.111) or (2.113) that 


(2.117) 


0 0 
aes a [ee ee ee fe oe Fe ses 
= [ete far oon dn 6* (@ — 2n(T)) 
de 
Z > [ate nest (Gane) 


a ee (2.118) 


In other words, the conserved charge in this case is the sum of the 
momenta of individual particles (total momentum of the system) and 
it is clear that under a Lorentz transformation it transforms like a 
contravariant four vector. 

On the other hand, if there are external forces acting on the 
system, we have seen in (2.114) that the divergence of the energy 
momentum tensor density (stress tensor density) of the system of 
particles is not zero and is given by 


OTe = f a 68" (e297). (2.119) 
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Let us assume, for example, that the particles carry individual charges 
Gdn and that the force they experience is due to the presence of an 
external electromagnetic field. In this case, we can write (see (2.103) ) 


They =f AD fa?! (w— alr) 


V da? 4 
far anrs ae 6° (4 — In(T)) 


Xr 
PS [ara at 54(x—a2n(t)), (2.120) 


where we have used the fact that the external field depends only on 
x and, therefore, can be taken outside the sum and the integral. The 
rest of the expression together with the integral sign is simply the 
current density which we have defined earlier in (2.73) and hence we 
obtain 


pV = Xr 
cyt a (2.121) 

To understand better what this means, let us define a second 
rank symmetric tensor from the electromagnetic field strength tensor 
(so that it is gauge invariant) in the following way 


1 
fa = cidae = — Fe RY of i nh’ FysF. (2:122) 


We can work out the explicit forms of the components of this tensor 
density associated with the electromagnetic field from its definition 
in (2.122) 


1 
700 = —F% FO ae a gee ae 


(em) 


i : és 
A (FoF + Poke + Bik) 


1 
= Foo + ri (—2F oi Foi + (—€ijnBr) (—exjeBe)) 
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= BE, -— 5 BE + 5 BiB: 
2 2 
u 2 2 
5 (E +B), 


Te 


F 1 F 
0 0 
—f ae ea a n eH 
= 07 mi 
= —-F%F"', 
= —(-Fo;)(-Fij) = —Foj ij 
—E; (—€i;nBx) = jr; Br = (E x B);. (2.123) 


This shows that the expressions for T) on) and T en 
to the correct energy and momentum (recall the Poynting vector) 
densities for the electromagnetic field as we know from electrody- 
namics. Therefore, we can identify T/* =) in (2.122) with the stress 
tensor density of the electromagnetic field 


It follows now from its definition in (2.122) that 


lead respectively 


1 
OTe) = =(aiF EY = FAO.E™ + 3 nt” (Ou EF?) Fyp 


= -(0,F4)PO — FyyotP +2 (OP) Fy, 


1 
2 


1 1 
= —(0,F4)FY ae - cae (arr” 4 a rev ) Ee 5 (a"F*) Fy, 


1 
= —(O,F4)FO +5 Fyp (ar 4+ OMFOY 4 arr’) 


—(0,F4)FO = —J) FO = FYI. (2.124) 


Here, in the intermediate steps we have used the Bianchi identity 
(see the discussion following (2.95)) 


Only as OF yy + On Fw = 0, (2.125) 
as well as Maxwell’s equations (2.92) in the presence of sources 
0,FM = J”. (2.126) 


Thus, we see from (2.121) and (2.124) that 
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» 
OT seer) = ERP = 0,7 


(matter) (em)? 


(2.127) 


which shows that if we define the total energy momentum tensor den- 
sity of the system of particles as well as the electromagnetic radiation 
field as 


Total) a Feasteh + Tas 
dz,” 
~ [ar Sne =e 64 (a — an(r)) 
1 
—FUARY + ri nt’ PPE, (2.128) 


then we have 


0,TH 


Hoialy = 0: (2.129) 


Namely, the total stress tensor density associated with the system 
including matter and radiation is conserved. (The energy momentum 
tensor density of an isolated system is divergence free because of 
Poincaré invariance.) 


2.6 Angular momentum 


Once we know that the energy momentum tensor density of an iso- 
lated system is conserved, we can construct other tensor densities 
from this, which would also be conserved. For example, given 


6,7’ =0=0,T", TH =T", (2.130) 


let us define 


MOP? Sahl aT. (2.131) 


We note that by construction this tensor density is anti-symmetric 
in the last two indices, namely, 
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MAH! = — Mrve, (2.132) 


We also recognize that it is divergence free in the first index, namely, 


0, Mv = — (reset) 
c 


2 (oe 4 HOT — KT — w"d,T") 
= T_T! 0, (2.133) 


where we have used the fact that the stress tensor is symmetric and 
conserved (see (2.130)). Furthermore, since M*” is conserved in its 
first index, we can construct the charge associated with this tensor 
density as 


j= [ee MON eee (2.134) 


We can show (in the standard manner, see (2.78)) that this tensor is 
constant in time (conserved) and transforms like an anti-symmetric 
second rank tensor under a Lorentz transformation. To see what 
such a conserved charge corresponds to, let us look at a system of 
free particles for which 


MOE ees Beg Te 


x" S| Phd? (x — Xn(t)) — a4 D7 0),6° (x — Xn(é)) 


d- (enPh — chp) 0° (x — xn(t)) 


n 


= So eign Lh6? (x — xn(t)). (2.135) 


In other words, the space components of the tensor density M0” 
correspond to the angular momentum density of the system of parti- 
cles. Hence we can identify J4” = —J”¥ as the generalized relativistic 
angular momentum of the system. 
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The angular momentum of a particle, of course, consists of two 
parts, the orbital angular momentum and the internal angular mo- 
mentum. The internal angular momentum is isolated by defining the 
Pauli-Lubanski spin four vector as 


1 
sn =5 Conn ws (2.136) 


where J’, u? denote the generalized angular momentum (2.134) and 
the four velocity (2.5) respectively. s, is a constant (time indepen- 
dent) four vector for a free particle which can be seen from 


ds, 1 , du? 
eS Je eae 
dé 2 MYX at 


= 0, (2.137) 
where we have used the fact that J’” is time independent and that 
in the absence of forces, the four velocity does not depend on time 
either. We note that in the rest frame of the particle 

u = (1,0,0,0), (2.138) 


the components of s,, take the forms (in the rest frame, there is no 
orbital angular momentum) 


1 , 
gore") = €ijn J)” = spin angular momentum, (2.139) 
ee) = 9, (2.140) 


Furthermore, although s,, may appear to have four independent com- 
ponents, it actually has three since it has to satisfy the constraint 


s. — 0; (2.141) 


in any frame of reference which follows from its definition in (2.136). 
(This is like the constraint on the relativistic force four vector derived 
in (2.9).) 
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CHAPTER 3 


Principle of general covariance 


3.1 Principle of equivalence 


Let us consider a particle moving in a static, uniform gravitational 
field as well as under the influence of other (translation invariant) 
inertial forces. The equation of motion according to Newton, in this 
case, would be given by 


d?x 
m a2 =mg +) F(x —Xn), (3.1) 


where the acceleration due to gravity g is assumed to be a space and 
time independent constant and the F,,(x — x,)’s represent inertial 
forces. As a consequence of the fact that the inertial and the gravita- 
tional masses are the same (see (1.43)), we conclude from (3.1) that 
we can transform away the gravitational force, in the present case, 
by a simple coordinate redefinition, namely, if we let 


1 
XOX 5 gt”, (3.2) 


we can write the equation of motion (3.1) as 


d?x 
mM Ta = SF a(x — Xp). (3.3) 


In other words, in this case we can always find a coordinate system 
where the effects of a static uniform gravitational field would dis- 
appear. (In this case, the new frame of reference would correspond 
to a uniformly accelerated frame.) In this sense, we can think of 
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a static uniform gravitational force as being an apparent force or a 
pseudoforce. 

Let us see physically what this example implies. Consider a 
person (observer) in an elevator (laboratory) in space (say, in a space 
ship) who performs the following experiment. He releases various 
masses and finds that they all fall down with the same acceleration. 
He wishes to find out the cause of the acceleration except that he 
is not allowed to look outside the elevator. Clearly from all the 
experiments that he can perform inside the elevator, he can conclude 
that there exist only two possibilities for the source of acceleration: 


1. There is a large gravitational mass attached to the bottom of 
the elevator which pulls all particles downwards. 


2. The other possibility is, of course, that the elevator is being 
accelerated upwards uniformly so that all particles appear to 
be falling downwards. 


However, he cannot differentiate between the two possibilities. This 
shows that the acceleration due to a static uniform gravitational force 
is equivalent to the uniform acceleration due to an inertial force. 
Therefore, by a suitable change of the coordinate system we can 
transform away the effects of gravitation and the equations of motion 
would take the same form as in an inertial reference frame without 
gravitation. 

There are two interesting issues that arise at this point. If the 
above analogy is correct we can do the following experiment. Let 
one side of the elevator be fitted with a source that emits a pulse 
of light. Let the opposite side of the elevator be fixed with a de- 
tector which receives the pulse. If the elevator is being accelerated 
upwards we can calculate and show that the light ray would define 
a parabolic trajectory as shown in Fig. 3.1. That is, its trajectory 
would bend. However, if this is equivalent to the presence of a grav- 
itational mass, then we would conclude that a light ray would bend 
in the presence of a large gravitational mass. This was, of course, 
a phenomenon unheard of in classical physics. However, Einstein 
was so strongly convinced of the equivalence principle that. he made 
the daring prediction that light rays would bend in the presence of 
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large gravitational masses. This was later verified experimentally by 
Eddington. (We would do this calculation later in the course of the 
lectures. ) 


Figure 3.1: Bending of light in a space ship. 


The second question we need to ask is what happens to this 
equivalence in the presence of a non-static and nonuniform gravita- 
tional field. Pulsars and other astrophysical objects provide physical 
examples of non-static gravitational sources. Our own earth provides 
an example of a nonuniform gravitational field since all particles are 
attracted radially towards the center of the earth. It is not possible 
to balance such nonuniform accelerations by any linear inertial ac- 
celeration as shown in Fig. 3.2. Therefore, in such a case the two 
situations would be distinguishable. In the background of a non- 
static and nonuniform gravitational force particles would feel tidal 
forces which would lead their trajectories to deviate from the vertical 
straight line motion. These geodesic deviations, of course, would al- 
low one to detect the presence of gravitation. However, if we restrict 
ourselves to very small regions of space (and time) then the gravita- 
tional field would be uniform and in such regions it would again be 
impossible to detect the presence of gravitation. 

These observations led to the formulation of the principle of 
equivalence. It says that at every space-time point in an arbitrary 
gravitational field, it is always possible to choose a locally inertial 
frame such that within a sufficiently small region around the point 
in question, the laws of nature take the same form as in inertial 
Cartesian coordinate systems in the absence of gravitation. 

Some comments are in order here. First of all, note that this is 
what is usually referred to as the strong principle of equivalence. If 
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Figure 3.2: The effect of gravitation is represented on the left diagram 
while the right denotes linear acceleration. 


the phrase “laws of nature” is replaced by “laws of motion of freely 
falling (moving) particles”, then it is referred to as the weak principle 
of equivalence. Furthermore, we are yet to specify what we mean by 
the form of the laws of nature. If we choose the laws of nature as those 
invariant under Lorentz transformations then we are led to what is 
known as the Einstein’s theory of gravity or the general theory of 
relativity. However, if we choose the laws of nature in the Cartesian 
frame to satisfy Galilean invariance, we obtain the Newton-Cartan 
theory of gravity. We note that these theories have essentially very 
different characters. In the case of Lorentz invariance we can define 
a four dimensional invariant length (c = 1) 


ds? = dr? = dt? — dx’, (3.4) 


so that such theories admit a nontrivial metric and lead to met- 
ric (Riemannian) theories of gravitation. However, in the case of 
Galilean invariance we cannot define a four dimensional invariant 
length. Theories with Galilean invariance, therefore, do not admit 
a metric and hence lead to non-metric (non-Riemannian) theories 
of gravitation. We will concern ourselves only with laws of nature 
that remain invariant under a Lorentz transformation of the inertial 
frames. 

The principle of equivalence further emphasizes that gravita- 
tion is a kinematic effect. One of the interpretations of this is that a 
gravitational field would not produce any new phenomenon that we 
cannot observe in a Lorentz frame. However, the physical interpre- 
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tations of the equivalence principle are still open to discussions. 

Furthermore, let us also note the similarity between the equiv- 
alence principle and the observations of Gauss. As we have already 
discussed, Gauss had observed that even in a curved manifold we can 
establish a locally Euclidean coordinate system (see the discussion 
around (1.8)). Principle of equivalence, on the other hand, tells us 
that even in the presence of a gravitational field we can find a locally 
inertial Cartesian coordinate system. Correspondingly, we can think 
of a gravitational field as producing curvature in the space-time man- 
ifold and the effect of gravitation on a particle can then be simply 
thought of as being equivalent to a particle moving in a curved geom- 
etry. That this is plausible can be seen with the following examples. 

Let us consider a sheet of rubber attached to some boundary so 
that it is flat. If we now roll a steel ball on to the sheet, it is clear 
that it would create some dip thereby producing a curvature in the 
manifold of the rubber sheet. If we roll another steel ball onto the 
rubber sheet it would again make a dip in the rubber sheet and it 
would move towards the other steel ball giving the impression that 
the two balls are attracted towards each other as indicated in Fig. 
3.3. This shows that it is plausible to think of the effect of grav- 
itation entirely as producing curvature in the space-time manifold. 
Gravitational attraction between masses can then be thought of as 
the motion of a particle in the curved manifold which is produced by 
the distribution of all other masses. 


ss a 
( 


Figure 3.3: Masses giving rise to curvature in a rubber sheet. 


A second example is the movement of bugs on the surface of 
a spherical apple. Let us allow two bugs at the equator to crawl 
towards the north pole along straight paths as shown in Figure. 3.4. 
Over a very small region (locally), they would, of course, appear 
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to be moving parallel to each other. However, over a larger region 
the paths would appear converging. This is what is known as the 
geodesic deviation and we have noted this in connection with the 
experiment in the elevator in Fig. 3.2. Furthermore, note that. if 
the apple has a stem, then the shape of the apple is not completely 
spherical. Therefore, when the bugs crawl near the dimple caused 
by the stem, there would be a large deflection in their paths. This 
is similar to the deflection caused in the path of a comet by the sun. 
It would appear as if the bugs are attracted by the stem and that is 
the cause of the deflection in their paths. 


Figure 3.4: Motion on the surface of an apple. 


These examples illustrate that in Einstein’s view space-time is 
curved due to a distribution of matter (masses) in the universe and 
on this curved manifold freely moving particles follow the straightest 
possible paths also known as geodesics. Locally the paths look like 
the paths in a flat manifold. However, over a finite region it is the 
geodesic deviation which exhibits the curved nature of the manifold. 
The way in which the dimple around the stem gives the impression 
of attraction corresponds to the fact that massive bodies modify the 
space-time curvature in their vicinity and this modification affects 
the geodesics in such a way as to give the impression that the free 
particles are being acted on by a force whereas in reality they are 
simply following the straightest paths in the curved space-time. 


3.2 Principle of general covariance 


In the case of Lorentz transformations we have observed that the 
length interval (c = 1) 
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ds? = dr? = nywdatda”, (3.5) 


remains invariant under a Lorentz transformation. We also know 
that a Lorentz frame is a special Cartesian coordinate system. In 
general for a curved manifold with a metric, the length interval is 
defined as (recall (1.9)) 


ds* = dr? = gy,(x)da"dz”. (3.6) 


Furthermore, we can use different coordinate systems to describe 
events but the essential point is that the length interval between any 
two infinitesimally close events remains invariant under a general 
coordinate transformation (change of coordinates). The principle 
of general covariance further says that all physical equations are be 
covariant under a general coordinate transformation. That is, they 
retain their form under a general change of coordinates. 

Let us examine the consequences of a general coordinate trans- 
formation on tensors in some detail. Let us define an arbitrary coor- 
dinate transformation of the form 


gh —> g! 4 = g! (gp), (3.7) 
so that 
Ip 
da > da’# = of ders (3.8) 
Ox” 


Any quantity V“(«) which transforms like the coordinate differential 
da” is called a contravariant vector. We note that in a curved man- 
ifold, coordinates x are merely labels (and are not vectors unlike 
the flat Minkowski space) and, therefore, their transformation sim- 
ply defines four functions. It is worth emphasizing here that, unlike 
flat space-time, here there is no concept of a coordinate x,, (since 
x is not a vector, we cannot lower its index). It is the coordinate 
differentials that behave like vectors. That is, under an arbitrary 
coordinate transformation 


ah —y g!# — go! (a), (3.9) 
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a contravariant vector V“(x) transforms as 


Ox! # 
V4 (2) 3 VE (a") = ce (3.10) 
Ox” 
Furthermore, under such a coordinate transformation 
a) ; 0 Ox” 


The transformation matrix in (3.11) is, in fact, the inverse of the 
transformation matrix in (3.10) and a vector A,,(2) which transforms 
like the gradient in (3.11) is called a covariant vector. Namely, under 
a general coordinate transformation, a covariant vector transforms 
as 


A,(x) > Ai) = Byrn A, (x). (3.12) 
The transformation 
gh — g! 4 = g!H(g), (3.13) 


implies an inverse relation of the form (the coordinate transformation 
is assumed to be invertible) 


=e (e'), (3.14) 


from which we can calculate the transformation matrix a in (3.12). 
Note that unlike Lorentz transformations, these coordinate transfor- 
mations do not have to be linear. 

Once we know how covariant and contravariant vectors trans- 
form under a coordinate change we can look for many other inter- 
esting consequences. First of all, we note that under a general coor- 
dinate change the scalar product of a covariant and a contravariant 


vector would transform as 
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A,,(x) BY (x) 


Ae Be) 

Ox” Ont = % 

Oa! # A(x) Ox Be) 

On? OL? x 

Oat Ord A,(x)B*(z) 

¥ A,(2)B*(2) = A(x) B*(2). (3.15) 


In other words, the scalar product of two vectors and, therefore, the 
length of a vector behaves like a scalar (invariant) under an arbitrary 
change of coordinates. Furthermore, since the scalar product of two 
vectors can also be written in terms of the metric tensor, we conclude 


that 


A, (@) BY (x) 


Ox! Og'¥ 
/ y Xr 
gla!) ZS AMa) SE BCA) 
Ox! Ox!” 
/ / Xr p 
gala!) a AN) BPA) 
9rp(x) A* (x) BP (a). (3.16) 


Therefore, this determines 


/ 


or, Iu (2") — Oxh Ox 9p(2). 


Ox! Oal¥ 
Gi) Oa» ED = Irp(2), 


nN Pp 
Ox* Ox (3.17) 


Note that we can write the scalar product of two vectors also as 
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A, (1) BY (a) gl (a) Ay(x)B, (x) 


Therefore, we obtain 


Ox® Ox? 
Ox'# Oxl¥ 


OR hog oy. 


g(a") =9"(x), 


or, g/#¥ (a’) 


(3.18) 


(3.19) 


so that the covariant and the contravariant metric tensors trans- 
form as second rank covariant and contravariant tensors respectively. 


They maintain their symmetry properties 


under a coordinate trans- 


formation. However, note that unlike the case of Lorentz transfor- 
mations, here the metric tensors change under a coordinate transfor- 


mation (they are not invariant tensors). 
Note also that the Kronecker delta 


defined as the product of 


the contravariant and the covariant metric tensor transforms as 


ob = g!\(z) gn, (a) 
> gf (2')gh, (2) 
Ox! Oa! Ox™ Ox? 
= get ) pues 
OxP Ox? Ox! Ox!l¥ 
- 308 Og'* On” az? Po () 
~ One Oat Ox Ogu i 
Ox! # Ox 
= axe ; Ox!” 9° (2) 9r6 


9r5(X) 
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Ox!" Ox 


~ Oa Aal¥ g?? (x) G9a5(2) 


Ox! Ax® 5 OE Orr 
—— aes ro vm" _— ae 
Oxe Oxl”Y 9 Axe AalY ope v2) 


This shows that the Kronecker delta which is a mixed second rank 
tensor is really an invariant tensor under an arbitrary change of the 
coordinate system. 

It is clear now that, under a general coordinate transformation, 
a contravariant tensor of rank n would transform as 


fae Ca ae hal elias a oa) 


Oa! #1 Oa! H2 Oar! in - 
— Be Bee erate an ee Yn (x); (3.21) 


Similarly a covariant tensor of rank n would transform as 


Diep tae) TE ein a) 


Vy V2 Un 
x x x 


7 Ox! HA Oa! b2 _ Ox! len Tivy--vm (2); (3.22) 


while for a mixed tensor, we have 


Tim (a) > Titim (a!) 


VY1...Un Y4...Un 
Ox! Ht Og! H2 Ox hm Og?! See am 
Ox = Ox2 Oxrm Ox — Ol Un pa--pn(@): 


(3.23) 


We note that a tensor retains its symmetry properties under a general 
coordinate transformation. Furthermore, it is worth recognizing here 
that the transformation properties in (3.21)-(3.23) have the same 
forms as we have seen in Minkowski space, (1.88), (1.96), (1.97), if 
we identify 


A,°(«) = (3.24) 
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and allow the transformation matrices to be coordinate dependent. 
Furthermore, it is important to recognize that the transformation 
matrices in (3.24) should not be thought of as tensors in a curved 
manifold. 


3.3 Tensor densities 


In addition to tensors, in a curved manifold we also come across ten- 
sor densities. The most familiar example of a density is, of course, 
the volume element (of the manifold). We note that in flat space, 
the volume element remains invariant under a proper Lorentz trans- 
formation, namely, 


/ 
Ox # 


d‘x > dt ‘= Ony 


d‘ax = (det A“,) d*a = d*a, (3.25) 


where 3 is the Jacobian of the transformation and we have used 


the fact that, for a proper Lorentz transformation, (see (1.68)) 


det AM, = 1. (3.26) 


(In fact, we have already used the invariance of the volume element in 
(2.87) to show that the conserved charge is a Lorentz scalar.) How- 
ever, under a general coordinate transformation in a curved manifold 


Ox! Ox! # 
d‘z + d‘z’ = |——| d*z = | det dtr 3.27 
e Ox Ox” ; oe 
where the Jacobian is no longer unity and we see that the volume 
element in a curved manifold behaves like a scalar except for this 
determinant factor. We call such a quantity a scalar density of weight 
+1 where the weight is determined from the power of the determinant 


of the transformation matrix oat that arises under a coordinate 
transformation. It is, of course, important to know that the four 
volume is a scalar density because to form a scalar action (which is 
invariant under a general coordinate transformation) for a dynamical 
system 
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S= [ate ie (3.28) 


we then need to choose a Lagrangian density £ for the dynamical 
system such that it transforms like a scalar density of weight —1 
under a general coordinate transformation. 

Another scalar density that frequently arises in the study of 
gravitation (metric spaces) is the determinant of the covariant metric 
tensor 


ee (3.29) 


Since under a coordinate transformation (see (3.17)) 


Ox*® Ox? 
Gla) =? Tur (2') = Oak DalY Dplh); (3.30) 
it follows that 
grog = det Git) 
Ox Ox? 
= det PTD det agi det 9p 
Oz |? 
= lanl 9 
Ax! |~? 
— : 1 
ac | 9 (3.31) 


Therefore, the determinant of the covariant metric g,, transforms as 
a scalar density of weight —2. The last step in (3.31) follows from 
the observation (through the use of chain rule) that 


OE GE" SON 5 +05 
Ox! Onl Oalv — ov» (3:82) 


and, therefore, 
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Oe | hee =: 
Ox| |Ox'| 7” 
Ox Ox’ |~! 
Or, Bal = Die (3.33) 


Since g’” and g,, transform inversely under a general coordinate 
transformation, we have (see (3.19)) 


det g”’(x) — detg’#’(z’) 


Ox! Ox!” v 


1\2 
oe. det g*?(z), (3.34) 


det 


so that det g’” (x) transforms like a scalar density of weight +2. This 
tells us that we can obtain a scalar action from a scalar Lagrangian 
density (with weight zero) as 


S= [ue HG Ls (3.35) 


In other words \/—g d‘z defines the invariant volume element in a 
curved manifold ((—g) is positive in any even dimension with our 
choice of signatures for the metric (1.55)). 

These are, of course, all scalar densities. But there is also a 
tensor density that is quite useful in curved manifolds. It is the Levi- 
Civita tensor density e“”? (in four dimensions), which is completely 
anti-symmetric and has constant components in any reference frame 
(this is what we normally know as the Levi-Civita tensor in flat 
space-time) 


+1. if uvAp are even permutations of 0,1,2,3, 
vA? — 2 _1 if pvAp are odd permutations, 


0 if any two indices are the same. 
(3.36) 
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We note that if it were a true tensor, under a change of coordinates, 
it would transform as 


Ox! Ox!” Oax!® Ox!? 
Ox® Ore Ox” Oxf 


HYP _y ef HUD _ gerne, (3.37) 


But by definition of a determinant the right hand side is nothing 


other than €””? multiplied by the Jacobian of the transformation. 
Namely, 


Ox! Ox!” Oax!® Ox!? 


bre Kong 
Ox® Ore OxD Oxf 
ra) "kK Ox! 
= ([ — ehvAp — ae ghar, (3.38) 


Consequently, we see that if ¢“”*? were a true tensor then it would 
transform as in (3.38) and, in such a case, it cannot be constant 
in any arbitrary frame (it will have different coordinate dependent 
values for its components in different frames). In other words, we 
can no longer say that in the new frame 


+1 for even permutations, 
cvAP — 2 _1 for odd permutations, 
0 if any two indices are the same. 


For this to be true, e””*? must transform under a coordinate change 
as 


cp, Ox Ox!" Oa!” Oa!» Ax!P rp 
Ox'| Ox® Ox Ox" Oxk 
0. Ox! 
= | |S] reso, (3.39) 


In other words the Levi-Civita tensor density must transform as a 
tensor density of weight —1. Similarly the covariant Levi-Civita ten- 
sor density must transform as a tensor density of weight +1, namely, 


80 3 PRINCIPLE OF GENERAL COVARIANCE 


Ox'| Ox" Ax? Ax" Oak 
EuvrAp —? Or! Ox’ Ox” Oxl® Ogle “RoME 
Ox'| | Ox 
— Dr Oa! EuvrAp — EuvrAp- (3.40) 


It is clear now that given any tensor density one can form a true 
tensor in the following way. Let t“1:-#" be a tensor density of rank 
n and weight w so that under a coordinate transformation 


ax! 
Ox 


© Oa! Ox! Hn 


[1 ..--Ln 
t —> aad SP Bene 


pwn, (3.41) 


Let us now define the quantity 


TH Mn (/—g)° tht Be, (3.42) 


Then clearly under a coordinate change 


dx! |~" "Ba! |” Bal Oa! Hn 
TH nr =y peas | PEaaee iss Y4...Un 
( Ox i Ox) Ox"1 Oxvn 
Ox! #1 Ox! Hn ae 
— apr ive Aas (/—9) t¥1 Vy 
Oa! #1 Oa a. 8 
= So See (3.43) 


In other words, TT!” defined in (3.42) would behave like a true 
tensor of rank n under a change of coordinates. 

We now see that we can define the following tensors from the 
Levi-Civita tensor densities 


ciuAp (/=9)° cHY AP 
Ewrp = V—9J Eprps (3.44) 


which will transform as true fourth rank anti-symmetric tensors un- 
der a general coordinate transformation, namely, 
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Ox! Ax!” Aax!® Ax! 


pV Ap aa skong 

. dx O27 Ox" Oxt ° , 

= Ox® Ox Ox" Oas _ 

“wre > Bain Oalv Bath Oath Cwone: 2) 


However, it is clear that, unlike the Levi-Civita tensor densities, the 
tensors in (3.44) would not have the form (3.36) in arbitrary coordi- 
nate frames. Let us note from the transformation properties of the 
Levi-Civita tensor densities, (3.39) and (3.40), that they can also be 
used to form a scalar action out of a Lagrangian density involving 
the anti-symmetric contravariant Levi-Civita tensor density (such as 
in the case of spin 3 fields). Namely, this provides an alternative to 
forming a scalar Lagrangian density with \/—g as in (3.35). 
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CHAPTER 4 
Affine connection and covariant 
derivative 


4.1 Parallel transport of a vector 


In a curved manifold various geometric concepts are inherently more 
involved than in a flat manifold. For example, let us discuss the 
idea of parallel transport of vectors in a curved manifold. We need 
to parallel transport a vector in order to define the derivative (of 
the vector) which transforms covariantly under a general coordinate 
transformation. We note that the derivative of a vector is conven- 
tionally defined as 


TE a a a (4.1) 
e—0 € 

where € represents an infinitesimal (scalar) parameter while n” de- 
notes a unit vector along the fixed direction x” (along which the 
derivative is taken). We see from this definition that the derivative 
in (4.1) involves comparing the vector at two different (neighboring) 
coordinates and, consequently, the difference is not a vector in gen- 
eral. To compare the vector at two coordinates and thereby define 
a derivative with covariant properties, we have to parallel transport 
A(x) to the point x + en such that the difference of the two will be 
a vector. 

Let us define the problem of parallel transport with a simple 
example. Let us consider the surface of a sphere and take a vector at 
the north pole pointing south along a given longitude. Let us move 
it parallel to itself along the longitude. The naive idea of parallel 
transport is, of course, to take the vector and put its tip on the 
arrow so that its direction is always along the south and length the 
same. When we reach the equator we move along the equator by an 
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angle 6 keeping the vector pointing south all along and then move 
back up to the north pole along a different longitude. In this entire 
operation we keep the vector pointing south throughout. However, 
we note that when we reach the starting point, the transported vector 
makes an angle @ with the starting vector. That is, we note that if 
we follow our naive idea of parallel transport, then when we move 
around a closed curve the initial and the final vectors are no longer 
parallel. This is because the surface of a sphere is a curved manifold 
and the parallel transport of a vector leads to new features. 


é 


Figure 4.1: Parallel transport of a vector on the surface of a sphere. 


Of course, the naive notion of parallel transport has to be mod- 
ified even in the absence of curvature if we are using curvilinear 
coordinates. We illustrate this with a simple example using polar 
coordinates in the flat two dimensional space. First, let us look at 
two vectors A and B at equidistant points from the origin in two 
dimensions as shown in Fig. 4.2. If we use Cartesian coordinates, 
then we can write 


A=(A,42), B= (Bi, Ba). (4.2) 


We say that B is the parallel transport of the vector A if the com- 
ponents are equal, namely, if 


Ai = Bi, Ag = Bo, (4.3) 
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Figure 4.2: Parallel transport of a vector on the plane to an equidis- 
tant point from the origin in Cartesian coordinates. 


B represents the parallel transport of the vector A and it follows from 
(4.3) that |A| = |B] (under parallel transport the length remains 
invariant). 

Let us now look at the same system in the polar coordinates 
shown in Fig. 4.3. Then, clearly the components of the vector and 
its parallel transport in polar coordinates are given by 


A = (A’,A®) = (Acos€, 4 siné), 
B = (B",B’) = (Bcos(é — do), = sin(é — d@)) 


~ (B(cos€ + dd sin £), = (sing — d6cosé)) (4.4) 
where r denotes the radial coordinate of the two vectors (they are 
equidistant from the origin). Here we have used the fact that in 
polar coordinates, the angular component is divided by the radial 
coordinate (recall that the angular component of the metric in polar 
coordinates is r?) and have identified A = |A|, B = |B]. We see that 
the components of the two vectors A and B are no longer identical 
in polar coordinates (even if we assume A = B) even though B is the 
parallel transport of A. That is, a vector being the parallel transport 
of another does not necessarily imply that the components of the two 
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Figure 4.3: Parallel transport of a vector on the plane to an equidis- 
tant point in polar coordinates. 


vectors have to be equal in a general curvilinear coordinate system. 
In fact, even the coordinate axes change as we move from point to 
point on the manifold. 

That having equal components does not lead to parallel trans- 
port in a curved manifold (or in curvilinear coordinates) can also be 
seen in the following way. Let A“(x) be a vector at the point z and 
let us assume that A“(x’) defines the vector at 2’“ such that 


AM (x) = Al(c"). (4.5) 


Namely, let us assume that the components of the vectors are equal 
at the two points so that we may naively conclude that A(z’) is 
the vector A(x) parallel transported to the point «’“. However, if 
under parallel transport the lengths of the two vectors are the same, 
namely, 


Suv (@).AM(a)AY (x) = guy(a') A" (x!) A’ (x'), (4.6) 


we conclude that this relation can be true only if 


Gup(z) = Gale): (4.7) 
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Namely, only if the manifold is flat or only if the metric does not 
change with the coordinates of the manifold will the equality of com- 
ponents of two vectors imply the lengths of the two vectors to be the 
same. In general, on a curved manifold (or in curvilinear coordinates) 


u(x) # Gu (2'), (4.8) 


and, therefore, parallel transport of vectors would imply 


A¥ (x) # AM(e'). (4.9) 


A constant vector is also against the spirit of general covariance. 
For example, let us assume that in a coordinate system % the vector 
A" (z) has constant components. Then in a new coordinate system 
x" we have 


a A (ey: (4.10) 


and clearly this cannot have constant components in all coordinate 
systems since the transformation matrix is in general coordinate de- 
pendent. (This is like the example of the vector in polar coordinates 
that we have discussed in (4.4).) Let us, therefore, find out how a 
constant vector in one coordinate system Z%” changes in a different 
coordinate system x” as we move from one point to a neighboring 
point on a curve parameterized by an invariant parameter 7 (proper 
time). In this case, we can represent the vector along the trajectory 
x'(7) as 


A* (a) = A*(a(7)), (4.11) 


from which it follows that 
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= OP xl! dz? D(z) 
— opoge dr 
OP ah .Ox? dat dE. 3 
= “Spe Oe ae oe 


Pat ore arr ‘ 
ee Oa7 7) ar A) 


3 es ae A” (x(T)), (4.12) 


where we have used (4.10) and have defined 


—— 
OV 


2 ant =p » 
ae OF =) (4.13) 


OP Or Ox? Ax” 


Although T'S, is by definition symmetric in o and vy, let us ignore 
its actual form derived above and write the differential increment 
of the vector A“(x(r)) as we move along a curve to a neighboring 
point as 


dAY = TH da? A”, (4.14) 


where we assume that ['4, is a matrix of arbitrary form to be deter- 
mined. This is known as the affine connection of the manifold and 
is clearly needed to compare vectors at neighboring points along a 
curve if we are in a curved manifold or if we are using curvilinear 
coordinates. 

What the above relation tells us is that if we have a vector 
A(x) at the point x, then its value when parallel transported to a 
neighboring point x + dz is given by 


A*H(atda) = A(x) +dA¥(x) 


= A*(c)+T4,(a)dx? A’ (x). (4.15) 


4.1 PARALLEL TRANSPORT OF A VECTOR 89 


Furthermore, if we want this to be a tensor relation valid in all co- 
ordinate frames then we expect 


A*'#(g! + da’) = Al*(a') + TE (a')da’? A'” (2’). (4.16) 


If the parallel displacement (transport) does not change the vector 
nature of A“, then we have 


Ox! dr 
A¥(a!) =~ aXe), 
Ox! # 
Atl (a! N= Am ; 4.17 
(x + da’) Aa Late (x + da) (4.17) 
Using these, eq, (4.16) leads to 
Oa! # Oa! # 
Aw = AY TH (2! lo Atv (a! 
TF, Ae Hea) = FA a) + Tada! A (0, 
(4.18) 


which can be written out explicitly as (see (4.15)) 


on? 
A(x) + TA, (x)dx* A" (x) 
Ox* Wee i 
a 
a or —AMa) + D(a! da! A! 2"). (4.19) 


Furthermore, we can expand 


Ox! # Oa! # 02 y' 


Ox Ox © Axndax ae 420) 


a+da 


and keeping up to terms only linear in the increment dx we obtain 
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Ox! O27! 


ST (AXa) + PA, ada" A(0)) + 2 da Aa) 
= SE aXe) + PY (e" da! A (2!) 
Ox! ¥ O7 x! 
TL (pl lo Alu (at) — » K An nN Ar 
or, (eda 7A (a) aa Ty ,dz* A(x) aagDy dx” A* (x) 
Ox! On” «4 OL) oo 
~ Agr rn() Ox! oS Oxl¥ a) 
Of SOG? 4808 ary 
Axx Oale "Oly (x), 
or,  (2')da’? A’” (2’) 
Ont Oe Opts alt Ox" Oa i 
— ey ——— a Atv 
ao On? Og! nn) dxda Ox!l™ oe) on ey 
(4.21) 
This determines 
Oz! Ax* Oa" O?2/h Ox" = Ox 
Ii (at) — » 
Pov (@) Ox Ox'? Oxl¥ nn (@) OxOx» Bal? AxlY’ ka22) 


Thus, we see that even if we may not know the exact form of the 
affine connection, the requirement of general covariance determines 
the transformation properties of the connection under a coordinate 
transformation. Furthermore, we note that the affine connection does 
not behave like a true tensor. It is the second (inhomogeneous) term 
in (4.22) which spoils the tensor nature of the connection. 

With these conditions, we see that parallel transport of a vector 
is defined through the relation (namely, how the components of a 
vector change under an infinitesimal parallel transport) 


dA = TH da? A”, (4.23) 


where the affine connection has the transformation property under a 
coordinate transformation as derived in (4.22). The transformation 
(4.23) (namely, the change under a parallel transport) is also known 
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as an affine transformation. We note here that affine transformations 
are quite common in physics although we may not always recognize 
them. For example, we note that in the case of the two dimensional 


example involving polar coordinates in (4.4), we can write (|B| = 
|A| = A) 


A” 


Acosé, A’ = s sin €, 

dA” = Ad@sin€ = d0Asiné = rdoA’, 

dA? = a do est -* d@Acce= -* doa". (4.24) 
Consequently, we conclude that this corresponds to an affine trans- 
formation with the identification (the components of the connection 


can actually be calculated explicitly once we know its definition in 
terms of the metric which we will discuss in the next section) 


ry or = Trg = 0, 

00 = 7; 

na = To, 

[, = 1=-2, (4.25) 


so that we can express (4.24) as 


dA’ = Tidal A®, (4.26) 


where i,j,k = r,@ and dx” = dr = 0(the vectors are equidistant), 
while dx? = dé. 

Some comments are in order here. First of all we note that 
manifolds where parallel transport of vectors is defined are called 
affine spaces. We simply need the notion of a connection to define 
parallel transport. A metric is not necessary for this in general. A 
space where a metric is also defined is called a metric space. Fur- 
thermore, although the connection does not transform like a tensor, 
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we note that under a coordinate redefinition, the difference between 
two affine connections transforms like a genuine tensor (since the 
inhomogeneous term in (4.22) cancels). Namely, 


T4, (2) — TY (@) 3 Pe (e') -TS @) 


1 OE POE OE" os Oa! Oa® Ax 
Oar Ox! ® Oal¥ "1" Ark Ax Axl? Ox!” 
Ox! Ax* Ox" = O?2/4 Oa" Ax" 


Oa Oal® Axl” — OxkAx" Aa!® Ox!” 


OR On” “Ont: pas =A 
Oar Axl? Oa!¥ (TA,(@) 7 Fan(2)) : ee 


Although initially within the context in which the connection was 
introduced, it was symmetric in its lower indices (i.e., in the study of 
gauge symmetries etc.) both Weyl and Einstein pointed out that for 
the purposes of defining parallel transport of a vector, no symmetry 
property of the affine connection is necessary. In other words, one 
should allow for the fact that the affine connection can be asymmet- 
ric, namely, 


A 1 
ia = 9 (ley + re) c 9 (ee ~ ie) ’ (4.28) 


where the anti-symmetric part is not necessarily zero. In this case 
the anti-symmetric part of the connection leads to what is known 
as the torsion tensor. However, for the purpose of discussions in 
classical gravity, we will assume that the connection is symmetric in 
its lower indices. Let us, however, point out the following two useful 
general theorems. 


If there exists a coordinate system in which the connection is 
asymmetric, 1.€., ie x Pee 


1. then the connection remains asymmetric in any other coordi- 
nate frame. 


2. then it is impossible to find a coordinate system in which all 
components of the connection vanish at a given point. 
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The proof of the first theorem can be given in the following way. 
Let us assume that there exists a coordinate system in which the 
connection is symmetric so that 


Vo.(#) = Tr. (#). (4.29) 


We know that under a coordinate change the connection transforms 
as (see (4.22)) 


0g 08 ORT. 2% je Og! Ox" Ox" 
Oar Oal® Oaly — *N Ox*Ox" Ox'? Ox'¥?’ 


TY! (a) (4.30) 
so that the transformed connection would again be symmetric and 
this symmetry property would be true in all coordinate frames. In 
other words, the connection cannot develop an anti-symmetric part 
through a coordinate change. Conversely, it follows that if the con- 
nection is asymmetric (has an anti-symmetric part) in one coordinate 
frame it must be asymmetric in all frames. (We cannot get rid off the 
anti-symmetric part of the connection through a coordinate transfor- 
mation.) 

The proof of the second theorem follows from the fact that if 
all the components of the connection are zero at a gven point in one 
coordinate frame, then under a coordinate transformation it would 
have the form (see (4.22) ) 


O272'& Ax*® Ox 


Popa!) = Ox* Ox" Ox!® Ox!” 


= Pas (4.31) 
Namely, the connection would have a symmetric form in all coor- 
dinates. (It cannot have an anti-symmetric part in any coordinate 
frame.) Conversely, if the connection is asymmetric (has an anti- 
symmetric part), all of its components cannot vanish at a given point 
in any coordinate frame since the anti-symmetric part cannot be set 
to zero through a coordinate transformation. Let us recall that the 
principle of equivalence says that locally we can always find a flat 
Cartesian coordinate system. This requires that all the coefficients 
of the connection must vanish in that frame at that point which, 
therefore, restricts the form of the connection to be symmetric in an 
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arbitrary frame in light of these theorems. This fact is expressed as 
the following theorem. 


The necessary and sufficient condition for the existence of a local 
coordinate system in which the components of a vector are not altered 
by an infinitesimal affine transformation is that the components of 
the affine connection be symmetric in the lower indices. 


To prove the necessary condition, let us assume that there exists 
a coordinate system where the vector components are unaltered by 
parallel transport, namely, 


dA“ =T4,da7 A” =0. (4.32) 


Since both dx? and A” are arbitrary, this implies that in this coor- 
dinate system 


TH (x) =0. (4.33) 


In any other coordinate system, therefore, the connection would be 


O22! Ox Ax" 
a Sy), (4.34) 
Of On" Ox? Onl is 
which is symmetric in the lower indices. 

To prove the sufficient condition we assume that the components 
of the connection are symmetric in the lower indices, namely, 


Pe (2') = 


Voy(#) = Pr (#). (4.35) 


Let us consider a point at the origin of the coordinate system x". 
Furthermore, let us consider the following coordinate transformation 


1 
gl = gh + 5 Ante 5 (4.36) 


where A4, is an arbitrary constant matrix (only the symmetric part 
in the lower indices participates in this coordinate definition by sym- 
metry). This gives 
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ag 5H 
Cnr. 
O77! # 1 
x=0 


Therefore, under this coordinate transformation (see (4.22)) 


Do (2") 


Ox® Ox? Ox'¥” *" — Axk Ox" Ox'? Ox!” 


7 (= On On". 2s Og O¢* a) 
z'=0 
«z=0 


1 
Kono T A, (0) + 5 (AX, + AM.) 657 


1 
Toy (0) + 5 (Aev + Abe): (4.38) 


In other words, under this coordinate change the connection retains 
its symmetric form as it should. Furthermore, if we choose the trans- 
formation matrix such that 


5 (AB, + A) = PE, (0), (4.39) 


then, in the new coordinate frame we will have 


(0) 0. (4.40) 


Hence in this coordinate system we have 


dA Pal dr? A’ =0; (4.41) 


We note that if [4,(0) has an anti-symmetric part, (4.39) cannot be 
satisfied and it would not be possible to find a (locally flat) coor- 
dinate system with [4/(0) = 0. Therefore, in the study of classical 
gravitation, where we frequently make contact with a (locally) flat 
Minkowski frame, we assume that the affine connection is symmetric 
in its lower indices. 
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4.2 Christoffel symbol 


We have seen so far that a vector can be transported parallel to 
itself in an affine space if we know the components of the connection. 
We do not need the metric at all for this purpose. However, if in 
addition we also require the length of a vector to remain constant in 
the process, then the notion of the metric also has to be brought in 
(in defining length). As we will see this determines the connection 
uniquely in terms of the metric tensor. 

Let us take two vectors A” and V“ defined on a curve x"(7) pa- 
rameterized by the proper time 7. The scalar product of the vectors 
along the curve is defined to be 


Gy (a(7)) AB (a(7))V"(a(7)). (4.42) 


If we require that the scalar product of the two vectors remains 
invariant as we parallel transport them to neighboring points (if A4 = 
V¥ this defines the length of the vector), then we conclude 


es (Gull Ve") =U, 


dr 

dg dA¥ dv” 

= Aty”’ p= VP pA —— = 0. 4.43 
or, ae V" + Op ae V+ Ou an ( ) 


If we now use (4.13) for the parallel transport of the two vectors (as 
well as the chain rule of differentiation) we obtain 


dax* % dx* x » da 
O9u ae ABVY + Il, ee APVY + IwAl'TS, ae Ver= 0, 
dx v 2 dx P 
or, On Gu rs AUV + (govT + Guo Sv) as APYV = 0, 


dx* 
or, (Or.9uv a Jovl S, + Gel ox) ae APVY = 0, 
or, OnGuv + Jovl Sy, + Goul Dr = 0, (4.44) 


where we have used the symmetry of the metric tensor as well as the 
affine connection in the intermediate steps, namely, 
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Juv = Ips We ee (4.45) 


Furthermore, if we cyclically permute the indices A, u,v, we 
obtain 


Onur a Gar ie T Jol \, = 0, (4.46) 


Ov Or + Jopl 2 T Gord is = 0. (4.47) 
Let us add (4.46) and (4.47) which leads to 


On9ur + Ov 9 a 290. hv IP Jovl S, - Goud Dr = 0. (4.48) 
If we subtract (4.44) from the sum in (4.48), we obtain 


On Gur» + Ov Grp — OG + Worl ivy + Gol Sy, + Gopl ya 
—9Govl Sy, = Gopl Dr = 0, 


or, Onde O10Nen = ONGur 2goal iy = 0, 


1 
or, Mi, = 5 97" (Qudvr + BG — ArGu) (4.49) 


This is known as the Christoffel symbol and defines the con- 
nection as the unique function of the metric in (4.49). First of all 
we note from this definition that the connection or the Christoffel 
symbol is explicitly symmetric in the lower indices 


ene Bee (4.50) 


Secondly we had noted earlier that the connection can be thought 
of as describing the effect of curvature of the manifold which is due 
to gravitational forces. Hence the connection can be thought of as 
the potential for gravitation. But we now see that the connection is 
uniquely expressible in terms of the metric and hence we can think of 
the metric tensor as the true potential of gravitation. Furthermore, 
if we can find a (locally) flat Cartesian coordinate system, then the 
connection would vanish at that point since the metric tensor at that 
point has constant value. 
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> Example (Parallel transport in polar coordinates). Let us consider the line el- 
ement on the plane in polar coordinates given by 


dr? = dr? +r7d6”. (4.51) 
The covariant and the contravariant metric components for the space follow from 
(4.51) to be 

Grr = 1, goo = r?, g” = 1, 9” = S58 (4.52) 


We note that the metric components in (4.52) depend at the most on the radial 
coordinate r which leads to 


Ongur = 2rd,0dvr9a1; LV, A= r, 6. (4.53) 


Using this, we can now determine the nontrivial components of the connection 
from the definition (4.49) 
o 1 or 
Le = ~ 99 (Ougvr + Ov Gur _ Onguv), (4.54) 


which have the forms 


1 
ey = Tbr = Pape Ve =r. (4.55) 


These can, in fact, be compared with (4.25). 
The general formula for parallel transport (4.14) 


dA” =e, dx" A’, (4.56) 


in this case, leads to 


dA” =1'9d0A° = rA°dé, 
dA® =P%,drA° +19 doa” = ~2(A°dr + A'd6). (4.57) 


Setting dr = 0 we recover the result in (4.24). 
< 


> Example (Parallel transport in spherical coordinates). Let us consider the three 


dimensional space parameterized in spherical coordinates so that the line element 
is given by 


dr? = dr? + r?(d6? + sin” 6d¢”). (4.58) 
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It follows from (4.58) that the nontrivial components of the covariant metric 
tensor have the forms 


grr = 1, goo = 7", go¢ = 1° sin” 0. (4.59) 


Since this is a diagonal matrix the contravariant metric tensor is trivially deter- 
mined from the inverse to be 


rr 6 1 1 
g =1, g f=, reer (4.60) 


Let us note from (4.59) that only r and @ derivatives of the metric tensor 
can be nonzero and we can write (u,v,A = 7,0, ¢) 


Ou gu» = 2r Surdvedr0 + 2(r sin? 6 +r? sin O cos 9) 5,,051¢0r0- (4.61) 
Using this as well as (4.60) we can now determine the components of the connec- 
tion from the definition (4.49) which take the forms 

Ti, = 7(du0dv0 + sin” 6 d.65v4), 


Cas -= (Ourdve + Op0dvr) + sin 8 cos 06.65, 


1 
pee ue (Surdv¢ + Suedrr) — Cot O(d,0516 + Susdve)- (4.62) 


The parallel transport of any vector A” can now be obtained from (4.14) 
to be 


dA" = r(d6 A® + sin? 6.d¢ A®), 


dA’ = -: (ar A’ + do A") +sin@cos0d¢ A®, 


dA® = -= (ar A® + 49.4") — cot (a0.4* +494’) (4.63) 


< 


> Example (Parallel transport in cylindrical coordinates). Let us consider the three 


dimensional space parameterized in cylindrical coordinates so that the line ele- 
ment is given by 
dr? = dr? + r°d6? + dz’. (4.64) 


It follows now that the nontrivial covariant and contravariant metric components 
of the space are given respectively by 
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Grr = 1, goo = r, Gen 1, 
Te 00 1 22. 
g =1, ay gf St. (4.65) 


Comparing with (4.51) and (4.52), we recognize that this problem is quite 
similar to the analysis carried out in the polar coordinates. Although the prob- 
lem involves an extra dimension, since the metric gzz = 1 and the other metric 
components are independent of the coordinate z, the nontrivial components of 
the connection remain the same as in (4.55), namely, 


1 r 
Peg = boe= os V9 =7, (4.66) 


The parallel transport of a vector is now determined to be 


dA” =1%,d0A® = rA°dd, 
dA® =TegdrA® +16,d0A" = —2(A%dr + A'd0), 
dA* =0. (4.67) 


We note that the z-component of the vector does not change under a parallel 
transport because the connection vanishes along the direction z, namely, 


Viv =0, pv=r,9,z. (4.68) 


Let us next show that the Christoffel symbol transforms cor- 
rectly (see (4.22)) under a coordinate change. We note from (4.49) 
that under a coordinate change 


o o 1 o 
Dp = ae = =) a a (Og + Hu ~ HGiw) ‘ (4.69) 


Let us simplify the expression inside the parenthesis in (4.69) 
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/ of / ot / ot 
Od ae Ors _ OG 


Ox"! Oa , (02 O21 
= On Or” Or “DIA Jui +O, Ox “ADA Oylle 77 Dap 
1. € OGM Og 
~ Oa GeHe aw Senn 
Ax"! Oa™ _, Ox™ Or _, 
ar’ Aglr Iida Ox Oalt VO pA 


Vv 
OE OL Sy 
One Og OSes 


O77" Ox Or = Orx M1 
+ a Ox! *f Or oa! Juyr1 
O22 Arlt = Aa OR 
+ pees Ox'l ar Ox Ox’ Axl! ae Drip 
OA! Ox" orl = 82x" 
= os Or’ Ox | Gui 
Or" Ax™ Axl ac™ dal Ax"! 


Bx’ Ax Art mI + Fox Oxll Ox” 1 Or 


Oa! Ox’? Oar 
TB Bagh. Sept On Jur 


dx 02x"! O21 

+ 9g” (sa Onno + Im ae 
Ox") 8241 O2ylt 

+ Oa (aur Oaeagr Fev aie) 


orl Q2y™1 arn" 
T Opin In Ox!” Ox!» ~ Gurr OrPar” 


Ox! Ox! Oa 
Ox'# Ox'” Ox!r (Oy Guidi + Orr Garp ~ Orr. Jurvr) 
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da Pe” 
+29 ds Ox'® Ox!4Ox!Y 


Ox™ [Or Ox" 
Bi | Oe! Oply (Oya Guidy + On Dap — Ons Gyr) 


O27 x"1 


+24y4d4 Or orl’ 5 (4.70) 


where, in the intermediate steps we have simplified the parenthesis 
using relations such as 


O21 O72! 
Iam Oya — I” ByOqle 
O24” O21 


~ ad Bema — M0 Berea et) 


Using (4.70) in (4.69) we obtain 


o u or 
Ee, = 9 g (On + HI = AGiw) 


oy On? OB 55. Oa 
~~ 9 Oa Oar 7 Ox!» 


Ox) Ox") 
Or Ox (Ou Guida a On Orr p01 _ Or, Ips) 


Ox” 
TAG Barn Gas 


a Ox'® 
A2 Ox 


= Ox"! 


a1A2 


1 
=-5 6 


Ash On¥ (Qu urdr + O49 — Od. Jurvr) 


02 7"1 
vere aera 


1 on, [| Ox’? Oc Or" 
Siig = Aa oan (Dus Quid + 1 Dray — Or Jur) 
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Ox! 02% 
+ 29a Ox §Ox!#Ox!” 
Ox!? Ox? Ox"! Or'® OP x 


= Gig ae 
$ Ox" OzlE Axl” Phin Ar Axl¥Axrl”Y (4.72) 


This shows that the Christoffel symbol does not transform like a true 
tensor. However, the inhomogeneous term in (4.72) looks different 
from the one we have derived earlier in (4.22). The fact that they 
are the same can be seen as follows, 


_ Ox'? 02x az, 6) Ox'? Ox fe) Ox'?\ Ax 
Ox Ox'HOc” ~~ Oa! \ Aa Axlv Ox'# \ Ox } Ox'¥ 
_ 9a (57) Ox? O (dz" On* 
- Or’ <¥ Ox'! Axe \ Ox ) Oxl¥ 
2 ile: p Xr 2 alo r p 
= Ox Ox? Ox S O*x Ox* Ox (4.73) 


OxPOx Axl Ax” ~~ AxrOxP Oa! Aal”’ 


Therefore, the Christoffel symbol (4.49) transforms like the affine 
connection in (4.22) and we conclude, through the principle of equiv- 
alence, that once we know the metric tensor, the parallel transport 
of a vector is simply given by the relation 


dA? = 1% dat AY, (4.74) 


where we can identify 


ae = -5 a (Opgur a Ov Ou = OG) : (4.75) 

Let us end this discussion by discussing an example from me- 
chanics to show that the Christoffel symbol occurs very frequently 
in the study of equations of motion, although we may not recognize 
them. For example, if x’(t) and «'(t) denote the generalized coor- 
dinates and velocities of a classical, non-relativistic system, then we 
can write down the Lagrangian of the system as (a dot denotes a 
derivative with respect to t) 
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1 fats 


where we have set the mass of the particle to unity for simplicity and 
V(a;) represents the potential in which the particle is moving. This 
is a very general form of the Lagrangian. For example, for a particle 
moving in a given potential in two dimensional Euclidean space, the 
Lagrangian has the form 


1 
L=5 (4? +9") —V(z,y), (4.77) 
which in polar coordinates becomes 
(? + °6?) — V(r, 6) 


= 5 9j%'t! —V (ai), (4.78) 


Nle wle 


with the identifications 


goo= 1f, c= 6, 
gi = 1, go. =r’, giz = g2i = 0. (4.79) 


The equation of motion (Euler-Lagrange equation) for this general 
system is given by 


d OL OL 

dt 0% Oat 

d 1 Ogjk OV 

Bee Ae EIN whee as ee re PA Ag 
mt dt (9%) 2 Oxi we Ox? 0 

3 , Oi; 1 Ogjk OV 

Spd pe eT. AR p= es IIE GR EE 28 ss 

ws ge Oe oan Ox? a 


i 1 &. 
or, Gig? + (a0 ~ 35 ean) ‘e* = Fi, 
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ieee ce 
or, Gig? + > (Oegji + Aj Gin — Oigir) pa? =F; 


or, #+ 5 go (Ongje + O;9e% — O09gjx) He" = F", 
or olga =F (4.80) 


We note that if we write the generalized (non-relativistic) velocity as 


0s (4.81) 


then in the force free case, i.e., when F’ = 0, the equation of motion 
(4.80) takes the form 


duty da? 
dt jk dt 


j 
ae ey (4.82) 


which is nothing other than the equation for the parallel transport of 
the velocity vector u’ (see (4.13)). Namely, the equation of motion 
in this (force free) case simply implies that the velocity vector is 
transported parallel to itself along the trajectory «*(t). 


4.3 Covariant derivative of contravariant tensors 


We have seen how scalars, vectors and tensors transform under a 
coordinate change. We have also seen that the gradients themselves 
transform like covariant vectors under a change of coordinates (in 
fact, their transformation defines covariant vectors, see discussion 
following (3.11)). Let us now see how the gradients acting on various 
tensors transform under a general coordinate transformation. For 
example, for a scalar function (tensor of rank 0) we have 


Ort 
QyG(x) + O60") = Se Ou. 9(c), 
MEL nt Oa!# 
Moz) + OG'(x') = OM (a). (4.83) 
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Therefore, we see that the cogradient and the contragradient of scalar 
functions transform like covariant and contravariant vectors respec- 
tively. 

Let us next look at the transformation of the derivative acting 
on a contravariant vector (contravariant tensor of rank 1), 


o1 ML 
Oc AM (x) > O,AM(a') = ae Oo ee A” (2) 


~ Ox!e Ort 
Ox" = 2g Ox Ox! 
Ba) A es aay ed 
Ox'? acai Be Ox!" Oxt 0 1 (x) ( 8 ) 


Thus we see that the cogradient of a contravariant vector does not 
transform like a pure tensor. (This is what we had pointed out earlier 
in motivating the concept of parallel transport of a vector.) The first 
term which spoils the tensor nature is, however, reminiscent of the 
transformation property of the affine connection or the Christoffel 
symbol. Namely, we know that under a general coordinate transfor- 
mation (see (4.22)) 


I, (x) > TA (2’) 


On OT Om ax. O2a'# = Ax Aa 


Orta Oa? Ax ~ 71% © Ax Oam Aal® Oa ~ 485) 


As a result, we see that under a general coordinate transformation, 


TY, AX(2) 3 TK AY (e') 


o fOne On” O79” 24), Aa! Ax Oa 
~  \ Oat Axle Oa — 7 * Bar Aart Ax’? Bar 
Ax 
* Ope A” (x) 


I o1 2 a! LL O1 
nn ef Ole OEM ais Ox Ox re 
2 \ Axt1 Axle 71M © Oar Ax Ox!? 
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/L O1 
= Ox’! Ox pe 
Orla Axle — 71A1 


Qin O1 
One OM: 5. 


A™ 4 
a Ox" Ox Ox'? 


(4.86) 


Consequently, we note from (4.84) and (4.86) that if we define a 
generalized derivative of a contravariant vector as 


D, A" (x) = 0, A"(x) —T4, (x) A*(2), (4.87) 


it would transform under a general coordinate transformation as 


DAM (z) + DLA (a’) = OLA (2') — TK (2') A(z’) 


5 HORE - OF at M(x) Ore? on? 
— Oal® Ax Oa ‘i Ox!® Ox 


Oo, A™ (x) 


OER OR cites. ooh Ga OL! as 
~ Orla Ox!e Toad oe 0x71 Ox age A (7) 
Ox"! Ox'# ' 
= Se Sm (0AM (@) - TH @)A(@)) 
Ox! Ox! 
= Se Dn AM (a). (4.88) 


In other words, the generalized derivative of a contravariant vector 
defined in (4.87) transforms like a pure tensor under a change of 
coordinates. Such a derivative which has covariance properties under 
a transformation of the coordinates is called the covariant derivative 
of a contravariant vector. 

To understand the physical meaning of a covariant derivative, 
let us recall what it means to take the derivative of a vector in the 
Euclidean space. In Cartesian coordinates we have 


. OA 
A'(x + dx) — A(x) = dx! ae O(dz?). (4.89) 

x 
Namely, we parallel transport the vector A‘(a) at the point x to the 
point «+ dx and the difference between A‘(x + dx) and the parallel 
transported vector in the limit of vanishing separation defines the 
derivative. (As we have already seen, in the Euclidean space, parallel 
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AM (x) A*# (a + dx) 


At (a + dx) 


Figure 4.4: Parallel transport of a vector and the covariant derivative. 


transport of a vector in Cartesian coordinates is trivial in the sense 
that A*(2 + dx) = A'(x). The reason why parallel transport is 
necessary is because A“ (x + dz) — A“(x) is not a vector in general in 
a curved space since the coefficient matrices for the transformation 
of the two vectors at the two points will be different. On the other 
hand, A“(a# + dx) — A*#(a + daz) where A*“(x + dx) represents the 
parallel transport of A“(xz) to the point x + dz is a genuine vector 
and hence can be used to define a derivative.) 

If we now extend the above definition to a curved manifold as 
shown in Fig. 4.4, we see that the definition of the parallel transport 
leads to the definition of the derivative as (see (4.15)) 


A" (a + da) — A*#(x + dz) 
OA 
On” 


A¥ (x) + dx” — A(x) —I¥, da” A* + O(dz?) 


Al 
da” (= — ri,4") + O(dzx?) 


da” (0,A" —T4,A*) + O(da”) 
= dz’D,A" + O(dz’). (4.90) 


In other words, the covariant derivative is the natural extension of the 
derivative to a curved manifold. Furthermore, note that if gravitation 
is absent, the covariant derivative reduces to the normal derivative 
in Cartesian coordinates. This also gives a method for introducing 
gravity into a Lorentz invariant theory. Namely, we take a Lorentz 
invariant theory and replace the Minkowski metric by the metric of 
the curved manifold and the ordinary derivatives by the covariant 
derivatives. 
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Let us next look at the contragradient derivative acting on a 
contravariant vector which would transform as 
0° AY — 0’? A'¥(z') 
I / 
— Ox'? Oo" (= Al! )) 


Oxl1 


Ox'? Ora! Og? Og? 


= o102 1 (q) 
Ox" 0x72 0z!1 Ox™ = Oxtt 


a7 AM, (4.91) 


Again we see that this does not transform like a pure tensor. But 
we have also noted in (4.86) that under a general coordinate trans- 
formation 


d Me Arr 
yA = sec 


ORE OE ie ee HOGS ORF 
Oxl Ogle ~ PIA OxP10x Ozx!P 


AX (4.92) 


which leads to 


gw “x gee ae 
5 (OR? On? pig f OLE ODE aus 43 n O2x/k Oxi 
Ax7 One xl Ogle Pir OxP1 Ox Ox!P 


r 2 I 
= 5h! g71P2 Ox? re » a Ora'h a 
Ox71 P2 Oxli — PIA Ox?! Ox 


Io 2 ya! Lb 
Ox O*x 
Ox Oxi Ox 
Ox'® 2 gle 


Ral BaP A*. (4.93) 


ra) iL fa) Io 


Ox! Ox! 
= O1pi pei » 
renga) 


It is clear now that the generalized derivative 


D7 AH = (ara" = oT, A*) (4.94) 
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would transform under a general coordinate transformation as 


D? A(x) + D'? A(z) 


a ODP. va ies rl! Axl? Aall Sein 
Ox Ox°2 Ox* Ox! Ox 
a a ee 

= BA (arn 

7 sa = Dm AM (@). (4.95) 


This generalized derivative, therefore, transforms covariantly under a 
general transformation of coordinates and is known as the contravari- 
ant derivative of a contravariant vector. Furthermore, we note that 


D°AY = AM — g°T!, AP 


g’?d,AP — Th, A = o°? (0,44 — 14, A*) 


9°? D, A", (4.96) 


as we would expect. (In fact, the contravariant derivative can be 
simply introduced through the metric tensor as in (4.96) since each 
factor, g°° and (D,A“), transforms individually like a tensor.) 

This shows that in the case of curved manifolds one has to re- 
place the ordinary derivatives by covariant and contravariant deriva- 
tives. However, the exact form of these derivatives depends on the 
objects they act on. For example, for a scalar function (see (4.83)) 


Dyol(t) = On6(x), 
D'o(z) = O"d(ax), (4.97) 
since ordinary derivatives acting on scalar functions have covariant 


properties. However, for contravariant tensors, the covariant and 
contravariant derivatives are defined as, 
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D, Perle 2 0, The = ee Ie 
SPN i 3 hn PU et (4.98) 
and 
DI TH Hn = Ge Dees, (4.99) 


That these have covariant transformation properties under a general 
coordinate transformation can be easily checked. 


> Example (Leibniz rule). Let us recall that if we have a product of vectors 


(A" B”), this behaves like a second rank tensor under a general coordinate trans- 
formation. Therefore, the covariant derivative acts on such a product as in (4.98) 


D(A" B”) = 0,(A" B”) —T!,(A°B”) — TG, (A"B?) 
= (0,A") BY + A"(0,B") — ([0,A°)B” — A* (TG, B") 
= (0,A" —T A?) BY + A" (0,B” —TG,B°) 
= (D, A") BY” + A" (DoB"). (4.100) 
Namely, the covariant derivative satisfies the Leibniz rule for a product of vectors 


(tensors). 
< 


4.4 Metric compatibility 
As a particular application of the results of the last section, let us an- 
alyze how the covariant derivatives would act on the metric tensors. 
From (4.98) we see that 

Dygh? = dygt? — TY 97 —TS gt. (4.101) 


Furthermore, if we use the expression for the Christoffel symbol de- 
fined in (4.49), namely, 


1 
The = 5 I" (Onder + OaGvr — OIr«) - (4.102) 
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we see from (4.101) that 


1 
gr? Oy gh? ss gre (Oy gov az Ox Jur _ OvIXo) ge 
2 


1 o 
15 a (Oy gov a Ox Jur = OvIXo) g 
1 
= Oy gh? a 2 gh” (Oy gov a Oz Jur =: Ovo) ge 


1 oO 
+5 gh” (Ongve + OvGern — 99) 9°” 
= dxgM + gt” (Oygov) 97? 


= Ag? + g” (Dy (govg?") — govO.9"") 


= OyghP — dHAg™? = Aygh? — AygHP = 0, (4.103) 


where we have used the symmetry properties of the metric tensor. 
This shows that the covariant derivative of the contravariant metric 
tensor vanishes. Furthermore, since 


go Gov = Os (4.104) 


it follows that (the covariant derivative, like the ordinary derivative, 
satisfies the Leibniz rule, see previous example) 


Ds (oP? dp) = 0, 
or, (Dog) Jpv + GoD SG iy = 0, 


Oty. al Deg a — 0: (4.105) 


Since this is true for an arbitrary metric we conclude that 


DeGpv = 9. (4.106) 


In a similar manner we can show 
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D? gt’ =0, PG =O (4.107) 


In other words, the covariant derivatives of the metric tensor iden- 
tically vanish. (Metric tensor is covariantly flat.) It is a constant 
tensor in the absolute sense in a curved manifold. Of course, this 
can be seen intuitively in the following way. In a locally flat Carte- 
sian coordinate system the metric tensor reduces to the constant 
Minkowski metric. In such a frame the Christoffel symbol vanishes 
since the metric tensor is constant. (The other way of seeing this is 
that the effect of gravitation is zero in this frame.) In this frame, 
therefore, 


Dege =O 0): (4.108) 


Since this is a tensor equation, it must be true in all coordinate 
frames which leads to (4.103). 

Of course, we could have reversed this argument and could have 
demanded 


D,gt” = 0, (4.109) 


and this would have led to the unique choice for the Christoffel sym- 
bol determined in (4.49). It is for this reason that this condition 
(covariant constancy of the metric) is also known as the metric com- 
patibility condition. This simply means that the parallel transported 
metric tensor has the same value at «+dz as the metric at that point, 
namely, 


Guv(e + dz) = gu(a + der). (4.110) 


Of all possible spaces, metric compatibility selects out a special class 
of spaces that is relevant in the study of gravitation. Furthermore, 
let us note that the metric compatibility condition rests on the fact 
that we can find a locally flat Cartesian coordinate system free of 
gravitation. This in turn requires the connections to be symmetric. 
In spaces with torsion, i.e., in spaces where the connection is not 
symmetric, metric compatibility raises special questions which we 
will not go into here. 
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> Example (Christoffel symbol from metric compatibilty). Let us consider the 
metric compatibility condition (4.105) 


Do guy = 0, (4.111) 


which when written out explicitly has the form 


Oo Guv + Ton Gor +1 Er 9up = 9. (4.112) 


This is exactly the same relation as in (4.44) (with o © A). Therefore, following 
exactly the same steps as in (4.44)-(4.49), we can determine the Christoffel symbol 
to be 


o 1 o 
Daw = “9 g * (Ougur + Ov 9p a Or guv) o) (4.113) 


which is equivalent to saying that the Christoffel symbol can be determined from 


the metric compatibility condition. 
< 


> Example (Covariant derivative of a scalar density). Let us consider the deter- 
minant of a second rank covariant tensor 


A = det Ay, (4.114) 


which appears to be a scalar. However, under a general transformation A in 
(4.114) transforms as 


Ox® x8 
aa’ Only °F 


2 gee OON ae og, 3a OO ore (4.115) 
= e Oa! e pv >= Oe : 2 


Thus we see that A = det Ay, is not a scalar, rather it is a scalar density of 
weight w = —2 (see (3.31)). 

We can construct a scalar quantity from this determinant by multiplying 
(appropriate power of) the determinant of the metric tensor of the manifold, in 
this case as 


AvtS A= det Aj, = det 


A=(-g)*A, (4.116) 


where g = det gy, and A would transform like a scalar under a general coordinate 
transformation. Since A is a genuine scalar, it follows from (4.83) that 


OA = Di A= Dy (( g) A) = —g DA, 


or, g DuA = OA = On ((-g)‘A) ; 


or, D,A=g0n(g'A). (4.117) 
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Here we have used the metric compatibility condition (4.105) in the intermediate 
steps. This shows how we can define the covariant derivative of the determinant of 
any second rank covariant tensor (or for that matter any tensor density) through 
the use of powers of g. In particular, we note that when Ayv = gyv, (4.117) leads 
to 


Dug = 99n (g-*9) = 0, (4.118) 


consistent with metric compatibility (4.105). 


4.5 Covariant derivative of covariant and mixed tensors 


For the present, however, we note that since the metric is covariantly 
flat, raising and lowering of tensor indices commute with covariant 
differentiation. That is, for a covariant tensor of rank n, we have 


DoT yxy p12...1m = Dz (GiaiGiaws ag Gp 


= Imr9uave*** Jpnvy DoT". (4.119) 


This rule is of particular interest since we can now write down the 
covariant derivative of a covariant vector in the following way 


DieAy =D (ow) = Gabo A’ 
Sj (a,A” = Td") 


= Os (GuvA”) — (O59,) AY GaAs 
= OyAy — (Orgur)A* — uv 4 A. (4.120) 


To simplify this let us use the form of the Christoffel symbol in (4.49) 
which leads to 
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(Os Gur F Gilad) At 


1 
py ge (Os 9X a OG pa = >) A* 


= _ — Guv X 


1 
Oo Ipr =, 3 of, (Os 9rp oF OVI po = 2g) A* 


, oIGpdx — (Os9du + OGu0 = 2,aoa) A* 
1 Xr 

5 (Oo9ur + OuGro — OnGou) A 

1 

2 


(Os Gur ae On Dre O9op) gP Ap 
= -1%,,A, =—T?),Ay. (4.121) 


As a result, we can write the covariant derivative in (4.120) as 


Dg Ay = Og-Ay +T3,,A). (4.122) 


We can derive this also from the fact that for the scalar product 
of two vectors (which is a scalar) we have 


Dz (Ap BY") = 0, (A, B") , (4.123) 


which leads to (using the Leibniz rule (4.100)) 


(De Ag) BP EAg(D, BY) =05( AB), 


or, (DzA,) Be + Ay 0,8" =T4_B”).= 0, (A, B"); 
or, (DzA,) BY = Ca i P3,Ax) BH 
or, Dg Ay = OgAy +13), Ay. (4.124) 


That this behaves covariantly under a coordinate transforma- 
tion can be seen as follows. 
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Dz A,(2) > DLA! (2) = OL Al, + TA Aj 


Ox! Ox! 
= Bye On (Sa Ay) 


& OR VOI is, Oa Ax —) Ox? 


da™ Ox’? ale TMT Opoieea xl? Oxle ) Ox 
Ox*! Oxtt Ox) Ox 
= Or’? Or't O17 1 Ox!7 ol Ox! M1 
o 2 apt rx Co 
45% Op On 5. ree Orn 088 Oa age . 
At Oa! Agi THAT? Ag Ag Oa! Aal#& Axle? 
(4.125) 
To simplify this expression we use the relation 
Og": Gare» 
nar gor = oN, (4.126) 
which leads to 
fa) Ir ra) A2 
On ae a = 
Ox Ox!r 
2c Oar? = Ax!» Ox? 
Bar aae OaxX * Ou CO (Sr) ee 
2 IX dA2 A2 Ir 
ee ee i oy 2) ee (4.127) 
; 02 0a OalA * \ dc" J Oxia 


Using this relation in (4.125) we obtain 


Do A(x) + DA‘, (2') 


Ox™ Oat Ox” Ort 

= Due Opn Cdn + aa On | aa | Am 
Ox'® Ox'l4 Ox Ox! 
Ox™ Oat 


nt. A 5 Ox? \ Oa! Oa Ox™ 
Oxle ale TUM 1 \ aX] Ozer Oxi ale? 
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6 Hele 6 i 
= Fe Fae (8m Ain + Poin An) 
Og” og Cg) Ox? \ 
+ Gq Ons (Gr) Aus ~ Bare Ons (SE) Ads 
Or?! Og" 
= Oxle Dale De, Au: (4.128) 


Similarly, we can show that 


DA, 


gD A, = 9°? (Ay ais T),Ax) 


0° A, + 97°T Ay, (4.129) 


also transforms covariantly under a general coordinate transforma- 
tion. We can now write down the covariant derivative of a covariant 
tensor of rank n as 


=. mr r 
DoT p12... = Oo Thuy...tun a Pop 2 te... tn ox PS pot tt d..-in 


eal fa ciig2d hs (4.130) 


olin 


The covariant derivative for a mixed tensor can likewise be shown to 
correspond to 


Ds TH. ee: ae 
= Oped OEE re oe ~~ I ys = i IAG aia aR 
HA, Pein ta FD Te, (4.131) 


4.6 Electromagnetic analogy 


We have seen that in dealing with gravitation and hence with curved 
manifolds, we have to introduce the notion of a connection in order 
to define the covariant derivative (of tensors). This is deeply con- 
nected with the fact that there is a local gauge symmetry associated 
with gravitation which we recognize as general covariance (invariance 
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under a general coordinate transformation). A similar situation also 
arises in flat space-time in the study of gauge theories. For exam- 
ple, if w(a) represents the quantum mechanical wave function of a 
charged particle, then we know that under a local Abelian gauge 
transformation, the wave function transforms as 


W(x) 3 U(x) = eM ab (2). (4.132) 


Under this gauge transformation we find that 


Aya) > Ay (2) = d, (e*Y(2)) 


= ~ie(d,a(a))e° (a) + eA, a(x), (4.133) 


so that the expectation value of the momentum operator is no longer 
invariant under this transformation (the derivative of the wave func- 
tion does not transform covariantly under the gauge transformation). 
On the other hand, if we require physical results to be invariant under 
such a gauge transformation, we find that invariance can be restored 
if we assume that there exists a connection A,,(x) (vector potential) 
which transforms under the gauge transformation as 


A,(t) > Ai,(x) = Ay + O,a(2), (4.134) 


and if we generalize the ordinary derivative to a covariant derivative 
of the form (recall minimal coupling) 


Dyab(x) = (Oy + ie Ay(x)) b(a). (4.135) 


In this case, under the gauge transformations (4.132) and (4.134) 
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Dyih(e) > Dirh!(x) = (Ou + teA,, (2) W(x) 
= 9,0! (x) + ieAl (x)! (2) 
= —te(A,a(2) eo) + eA, ah(2) 
+ie(Ay(x) + O,a(x))e7%e w(x) 
eM) (8, + ieAy(#)) (2) 
= Ale) DF b(a). (4.136) 


Namely, the covariant derivative of the wave function transforms 
covariantly under the Abelian gauge transformation. We note here 
that the connection A,,(x), in this case, is nothing other than the 
electromagnetic potential (1.83) or the Maxwell gauge field and the 
presence of the gauge field or the connection is intimately connected 
with the local gauge symmetry associated with electromagnetism. 
Let us also note from (4.134) that the connection (gauge field) does 
not transform covariantly under the gauge transformation because of 
the inhomogeneous term much like the transformation of the affine 
connection in (4.22). 

Similarly in the case of gravitation (curved manifold) we note 
that the presence of the connection signals the presence of a local 
gauge symmetry in the theory which corresponds to the general co- 
ordinate invariance of the system. Furthermore, this also suggests 
that Einstein theory, which describes gravitational forces, must have 
the nature of a gauge theory and we will study this later in the 
course. 


4.7 Gradient, divergence and curl 


Although the derivatives in the case of a curved manifold have to 
be generalized to covariant derivatives, there are special cases where 
they take simple forms. We have already seen in (4.97) that for a 
scalar function 


D,¢(x) = O,¢(2), (4.137) 
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so that the gradient of a scalar function remains unchanged in a 
curved manifold. Furthermore, we note that the covariant derivative 
of a contravariant vector is defined as (see (4.87)) 

DyA =O, Ak ST igh (4.138) 
Therefore, we can define the (covariant) divergence of a contravariant 
vector as 

D,A* = AP. = O,AY —THA’, (4.139) 


where a semi-colon conventionally denotes covariant differentiation. 
To simplify this expression let us recall that 


1 
age = ~5 gee (Ongop + O69ou — OpGJuc ) 
1 
=> ~ 5 (0? gop as OF Os Goes _ OF one) 
1 up 
= "9 g Os Go (4.140) 
Let us now use the following simplifying relation. If A denotes 


any (square) matrix, then we note that we can represent its deter- 
minant as 


det A = eT A, (4.141) 


This is obvious if the matrix A is diagonal, for in this case 


det A= |] Ai, (4.142) 


where the diagonal elements A; represent the eigenvalues of the ma- 
trix and we have 


Yond; 
etna — @%i =[[>i=aet 4, (4.143) 


a 
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which establishes the equivalence in (4.141). If A is not diagonal, on 
the other hand, we can write 


A=SApS", (4.144) 


where Ap is the diagonalized form of A and S is the similarity trans- 
formation that takes A to its diagonal form. In this case, we have 


einA _ ,Trln(SApS-!) _ (Tr(InS+nAp-WS) 


el*nAp — det Ap = det SApS~! = det A. (4.145) 


This proves the formula (4.141) in general. If we further assume 
that the matrix A depends on space-time coordinates we can take 
the derivative of the determinant of the matrix with respect to coor- 
dinates to obtain 


d,(det A) = O,e™™4 = 8,(Trln A)e™™4 
Tr(0, In A) det A 
Tr (A~*0,A) det A. (4.146) 


Let us apply the relation in (4.146) to the determinant of the 
metric tensor (metric tensor can be thought of as a 4 x 4 matrix in 
four dimensions), which leads to 


O59 = O05 detiguy = (g"" Ocguu) G; (4.147) 


where we have identified g = det g,,,. Therefore, we have 


foes: l — (—g) pV 
On g 2./—9 (—059) — 2./=9 g Os Oo, 
= a gh!” Do Guys (4.148) 


so that comparing with (4.140) we can write 
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1 
T= = g!? OsG5u = — 


1 
s OoV=9. (4.149) 


The divergence of a contravariant vector now follows to be (see 
(4.139) ) 


D,A* = 0,A"—TH,,A° 


= 8,A + = (Op V=9) A” 
= = On(V—g AP). (4.150) 


This leads to an invariant formulation of Gauss’ theorem in 
a curved manifold. Namely, if the vector field A“(a) vanishes at 
infinity, then 


[ vaiate Dat = fate Vp Te =H") 


[ae On(/—g A“) = 0. (4.151) 


A particularly simple application of the expression for the covariant 
divergence in (4.150) arises when the vector field itself corresponds 
to the gradient of a scalar function. In other words, 


D,,D"¢(a) = D,,(0"¢(2)), 
oi 
V-g 


where U stands for the invariant D’Alembertian in a curved manifold. 
We note here that much like the covariant derivative, the exact form 
of the D’Alembertian depends on the space of functions on which 
it acts. However, from (4.152) we see that acting on the space of 
scalar functions, the invariant D’Alembertian can be identified with 
the operator 


or, (2) = On(/—gO" d(x), (4.152) 
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On /=G gt” Op. (4.153) 


1 1 
= — 0,,/-g 0" = — 
v-g " V-9 
Let us apply this result to the simple case of three dimensional 


Euclidean space in spherical coordinates with the line element given 
by 


ds” = dr? + r?(d6? + sin? 6d¢?). (4.154) 


In this case, we can read out the nontrivial components of the co- 
variant metric tensor to be 


Grr = 1, goo = 1", I¢¢ = r’ sin? 8, (4.155) 


and the inverse (contravariant) metric tensor is given by 


1 1 
g=1, gG@=5, 9% ==, (4.156) 


which leads to 


g = det Gi3 = Grr 900966 = r* sin? 6. (4.157) 


Thus using (4.152) we see that, in spherical coordinates, the Lapla- 
cian (analog of the D’Alembertian in three dimensions) acting on 
scalar functions becomes 


1 
r2sin0 


V’7b = Ga 0; (/99'7 9; ®) = 0; (r? sin 0g’ 0;®) 


oy cat [a (r? sin 0g""0,) + Op ie sin 0g°0p) 


r? sin 0 
+0x [e sin 0g°%0,)| ® 


Sh) [ac sin 0,) + Op G sind 5 ay) 


r2 sin @ 
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1 
+ Og (sino ad as) ® 


sin? 0 


Or : Or ane 00 ane 00 sino Og? 


(4.158) 


[ers 1 0 a) 1 | 


so that, in this case, the form of the Laplacian (acting on scalar 
functions) in spherical coordinates has the familiar form 


2 
a apse ry eee 


dr! Or | sin000- 00 am (4.159) 


It is worth noting that the divergence of some of the tensors 
also takes a particularly simpler form. For example, we note that for 
a second rank tensor 


DP Oy Pel pe pe 


1 
= OT" + T— (BaV=9) TY — Vig T'? 


1 VY Vv Oo 
2 OO era a ee AB (4.160) 


In particular, if the tensor is anti-symmetric, namely, 


THY = TH, (4.161) 


then the second term in (4.160) vanishes (because of the symmetry 
of the Christoffel symbol in the lower indices) and we can write (for 
TH = —T") 


1 
DT = —— 0 (afag hk): (4.162) 
LU J/g LM 
This result generalizes to any tensor that is completely anti-symmetric 
in all its indices. 
The covariant curl of a covariant vector takes a particularly 
simple form. For example, we know that (see (4.122)) 
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Dy Ay = OA, + TA, Ay, (4.163) 


from which it follows that 


DyAy — DyAy = Avi = Any 


0,Ay, +T3,A — 0,4, —TA,Ay 


= 0,A,—d,Ay, (4.164) 


where we have used the symmetry of the Christoffel symbol (4.50). 
This shows that the curl of a covariant vector remains unchanged 
in the presence of gravitation (or in a curved manifold). However, 
it is worth emphasizing here that this follows only because we have 
assumed the connection to be torsion free (the connection is symmet- 
ric). In the presence of torsion, this relation would modify since the 
connection will have an anti-symmetric part as well. In the present 
case we note that the curl of a covariant vector is a second rank 
anti-symmetric tensor, 


Aw = DyAy — Dy Ay = 8,Ay — 0,Ay = —Avps (4.165) 


which does not involve the Christoffel symbol at all. 
Let us next note that for a covariant second rank tensor which 
is anti-symmetric A,, = —A,, (see (4.130)), 


Aia% => Dy Aw — O)\ Ap + PS Aov + Ate 


OA = PS Ave or 5 Abies (4.166) 


where we have used the symmetry property of the Christoffel symbol 
as well as the anti-symmetry of A,,, and have introduced the conven- 
tional notation of representing a covariant derivative by a semi-colon. 
Taking cyclic permutations of this relation we obtain 


DyAvyy = OpAvrys —ThyAre +T5,,Ave, 


Dy Ap = Ov Axp _ Loe + Pe (4.167) 
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Taking the sum these three relations we obtain 


Dj Apy - Dy Ava Di Ax 
= Ay —T§,Ave +10) Apo + OpAva — 1%, Arc +15, Ave 
+0yAyy, — U2) Apo +1%,Ayo 
= OyAw + OuAvy, + WAyu- (4.168) 


That is, in this case all of the effects of gravitation drop out. This 
is particularly interesting since it implies that a relation such as the 
Bianchi identity in the case of electromagnetic theory remains un- 
changed in the presence of gravitation (or in a curved manifold). 
Namely, 


which is the Bianchi identity in flat space. 


> Example (Nijenhuis torsion tensor). As we understand now, the ordinary deriva- 
tive acting on tensors in a curved manifold does not transform, in general, like a 
tensor under a general coordinate transformation. We need to use the covariant 
derivative for this purpose. However, we have also seen in (4.164) and (4.168) that 
under special circumstances, the connection (Christoffel symbol) in the covariant 
derivative drops out and the combination of ordinary derivatives indeed leads to 
a true tensor. Nijenhuis torsion tensor is one such tensor in a curved manifold. 

Let S! be a mixed tensor of rank 2 with one covariant and one contravariant 
index. Using the covariant derivative (4.131), we can now define 


Na, = S2DoS> — S2DeSp — S2(D,S° — DL S2) = —N2,, (4.170) 


which by construction is a mixed tensor of rank 3 with one contravariant and two 
covariant indices. 
If we write out the tensor (4.170) in detail, we obtain 


NS SE (yS) BTS 5S) = TA8S) = SoS Te STS Se) 


— $3(0,87 +1889 —17,8°) + $3(d,S2 +1%,,8° —T%,S2). (4.171) 


We note that if we use the symmetry property of the Christoffel symbol (4.50), 
all the terms involving the connection cancel pairwise and we have 


NY = —N2, = 828.8) — 820,58) — S3(0.82 — 0.82). (4.172) 
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Even though the tensor (4.172) involves only ordinary derivatives, it is a 
true tensor. This is known as the Nijenhuis torsion tensor associated with the 
mixed tensor S! and plays an important role in the study of complex manifolds 
(as well as in integrable systems). 

< 


Therefore, we see that if A, is a covariant anti-symmetric ten- 
sor of second rank, namely, 


Aw = — App, (4.173) 


then we can construct an anti-symmetric tensor of third rank from 
A,» independent of the Christoffel symbol (this would correspond to 
taking the curl of the second rank tensor), as 


Appr = Dy Aw + Dy Ap + Dy Ax; 
Adj toa, (4.174) 


These results suggest the following general fact. Given a covariant 
rank n anti-symmetric tensor, we can form a unique covariant rank 
n+ 1 anti-symmetric tensor which does not involve the Christoffel 
symbol (by taking its curl). The general result can be quickly proved 
with p-forms which we will not get into. Here let us study this only 
for a third rank tensor. If A,,,, is a completely anti-symmetric tensor, 
then 


Dy Ayr = Op Aww + Pou Aovr + ey Auor + Py Awe 
= Op Air + Pow Avro + Py Anus + Py Ape- (4.175) 


Taking the cyclic permutations of this relation we obtain 


Dy Avxp = On Avxp + Pv Arpe + Pr Apve + Pi pAvro> 
DyA)ou = OvA) pu a Pon A pue + Po oAprs + Po uArpo> 
D)Apw = On App + PS pAuve + PS pAvpe + ipsipe errr (4.176) 


It follows now that the totally anti-symmetric combination 
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Aprp a De Agex _ DpAyxe + DyAypu _ Dy Apyy 
= Op Avr _ OnAvrp + OvA) pu = OVA giv (4.177) 


which is, of course, a covariant anti-symmetric tensor of rank 4 in- 
dependent of the Christoffel symbol. 

Some comments are in order here. First of all we note that we 
can increase the rank of a covariant anti-symmetric tensor by one 
simply by taking its curl. If we try to apply this procedure twice, 
then it yields a tensor which vanishes identically. For example, we 
note that given a scalar function, we can construct a covariant vector 
by applying the covariant derivative (curl) 


Ay = Dud = On. (4.178) 
If we try to construct an anti-symmetric second rank tensor from 
this by taking its curl, then we find 

Day = Dy Age = Oho Oy An 

= 0,0,¢— OO,b = 0. (4.179) 

Let us next consider a covariant vector A, and construct the second 
rank anti-symmetric tensor by taking its curl as (see (4.164)) 

Ay = DyAy— DyAy.= 0; Ay —. Op Aje (4.180) 
If we now try to construct an anti-symmetric tensor of rank 3 by 
taking curl once more, we have (the square bracket represents anti- 
symmetrization) 

Aid] = Dy Aw + DyApr + Dy Ady 


= Orn Aww ae On Avr oF OpAXa 


On (Oy Av — OA) + Ou(O,Ay — O,Ay) + Oy(O,Ap — Oy An) 


= 0, (4.181) 
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where we have used (4.168). We can show that this result is true in 
general and the general result follows in a simple manner with the 
use of p-forms that we will not go into. Furthermore, let us note that 
in n dimensions, the highest rank nontrivial anti-symmetric tensor 
would have rank n. This is easy to see because if there exists an 
anti-symmetric tensor of rank n + 1, then two of its indices have to 
be the same in which case it will vanish identically. The same is true 
for any higher rank tensor as well. 

Let us also note that in n dimensions any anti-symmetric tensor 
of rank n must be proportional to the generalization of the Levi- 
Civita tensor to this space. This can also be seen simply because 
if 


Die. din] (4.182) 


is the anti-symmetric tensor (anti-symmetrization is denoted by the 
square bracket), then the combination 


er, (4.183) 


1---Ln]? 


is a scalar function (density). Therefore, we can always write 


Dijias.sten] = Ey ...un o(x), (4.184) 


where (x) denotes a scalar density. 
Let us define the following terminology. An anti-symmetric ten- 
sor with the property 


DpyAw...) = 9; (4.185) 
is called closed. (Namely, the covariant curl of the tensor vanishes.) 


Furthermore, an anti-symmetric tensor Aj,,,...| is called exact if it 
can be written in terms of a lower rank (anti-symmetric) tensor as 


Aj...) = Dy Fp.) = Owl ...1- (4.186) 
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Thus for example, in the case of three dimensional electrostatic in 
Euclidean space, as we know, one of Maxwell’s equations takes the 
form 


VxE=0. (4.187) 


Therefore, the electric field can be thought of as a closed vector and 
since we know that it can be written as the gradient of a scalar 
potential 


E=-V4, (4.188) 


it is also exact. 
Another example follows from the definition of the electromag- 
netic field strength tensor itself (see (1.83) and (4.164)) 


Fy = DyAy — Dy Ay = On Av — OAg, (4.189) 


where A,, denotes the vector potential. Thus we see that one can 
think of the electromagnetic field strength as an exact tensor. Fur- 
thermore, we know that the field strength tensor satisfies Bianchi 
identity (see (4.168)) 


Dy Fup a DuFinr + DUP yu = Ovo Fy i Onl vd + OnE yi 
= 0O)(O,Ay — Op Ay) + Oy(O,A, — O,Av) + O,(O,Ap — OnA3) 
= 0, (4.190) 
so that it is also closed. These observations lead to the following two 
theorems. 
Theorem 1: Every exact tensor is closed. 


It follows from the definition of an exact tensor in (4.186) that 


Tys..-tin] = Oya T (4.191) 


H2...fn]]* 
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Therefore, since derivatives commute (their product is symmetric) 
we have 


Ding Myr-pnll = pings Thpa-pnall = sens eer Mra...tenll] = 9 
(4.192) 


which proves the theorem. (This is the same result as we had dis- 
cussed earlier, namely, that taking the curl twice gives zero.) 


Theorem 2: Every closed tensor is at least locally exact, 1.e., it 
admits a tensor potential locally. 

We will not prove this theorem here but simply study its con- 
sequences in four dimensions. 
Theorem: In four dimensions a second rank anti-symmetric tensor 
being closed is equivalent to its dual having zero divergence. 


Let F,,, be a second rank anti-symmetric tensor in four dimen- 
sions. Then it is closed if (see (4.185)) 


Dy Fup + Da Fox + DIP yy = OF py + Onl on + OP yy = 0. (4.193) 
Furthermore, the divergence free nature of the dual tensor (see (1.81) ) 


is defined as (any factor of ,/—g commutes with the covariant deriva- 
tive and, therefore, can only be a multiplicative factor) 


~ 1 i 
Die = Dy, (5 oF) = DED 


1 1 
= 5% aoe (DuP yp + Dy Foy + DpFux) = 0. (4.194) 


This is known as the Bianchi identity and as we had noted earlier in 
chapter 1, the two relations (4.193) and (4.194) are equivalent. 


CHAPTER 5 


Geodesic equation 


5.1 Covariant differentiation along a curve 


Sometimes a vector field or a tensor field is defined only on a certain 
trajectory rather than on the entire space-time manifold. Thus for 
example, the four momentum or the stress tensor or the spin vector 
of a particle can be defined only along the trajectory of the particle. 
In this case we have to develop the notion of covariant differentiation 
along the curve. 

From what we have seen already in the last chapter, the co- 
variant derivative measures the change in a vector or a tensor in an 
absolute sense. Let us apply this notion to a vector €“ defined along 
a curve z(T) so that we can write 


EM (a(r)) = &*(r), (5.1) 


and at an infinitesimally separated point on the curve we have 


dé" (r) 


re O(dr?). (5.2) 


é(r + dr) = €"(r) + der 
On the other hand, we have defined the parallel transport of a vector 
€#(r) to the point 7 + dr as given by (see (4.15)) 


dx” 


e €*(r)dr. (5.3) 


er deer) = Ea) 


From (5.2) and (5.3) we obtain 
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=) OT pede eh y ar )dr + O(dr*) 


dé" (r) 
dr Ud dr 


dé" (7) m de 3) 2 
= dr F dase rs €“(r)dr + O(dr*) 


dg" da” 
- ar( oo —-™, — 7) + O(dr?) 


DE&H (7) 


arene Dr 


+ O(dr?), (5.4) 


which defines the covariant derivative of the vector €“ along a curve 
to be (see also the discussion around (4.90)) 


Dé" dé" da” *.:5 
oe VE a ees Bey Picea 
Dr dt vv dr é 
da” Og! dx’ 
= pe eee 
dr Ox’ vr dr é 
dx” 
= Bh pe ¢X 
dr (ag Ps ) 
SMe yee (5.5) 
drt 


Comparing with (4.12) we recognize that the parallel transport of a 
vector along a curve can also be written as 


DEM. dik tie 
Dee oe (5.6) 


Similarly, the covariant derivative of a contravariant tensor field 
along a curve can be expressed as 


DTM da” 
ee eee (5.7) 


We note from (5.7) that 
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Dgt” dar 

=. Dor = 0h 5.8 
Dr dr a oo) 
which follows from the fact that the metric is flat under covariant 
differentiation (see (4.103)). Since g,,, denotes the inverse of the 
contravariant metric tensor g’”, it follows that 


Xr 
= =~ Paguy = 0. (5.9) 


Using these, we can now derive 


Fee O Guek) = Sue (5.10) 
Similarly, for an arbitrary covariant tensor we have 
Se Tnswtn) = G-(Ghrva GanvyT) 
D 
= Jur + Gunn Bo ee Vs (5.11) 
and so on. 


5.2 Curvature from derivatives 


So far we have talked about the metric and the connection as rep- 
resenting the effects of gravitation. However, we have also said on 
several occasions that gravitation produces curvature in the space- 
time manifold. But, we are yet to define and identify the curvature. 
We will do so now in two different ways before studying its properties 
in detail in chapter 7. 

First, let us go back to the example of electromagnetism in sec- 
tion 4.6. We have noted earlier that invariance under the U(1) gauge 
transformation requires that the ordinary derivative be replaced by 
the covariant derivative defined as (see (4.135)) 


Dyth = (Ou + 1eAy)b. (5.12) 
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On the other hand, the first property that we notice from the defi- 
nition of the covariant derivative is that unlike ordinary derivatives, 
the covariant derivatives do not commute. Namely, 


[Du, Di|v = (DuDv ae D, Dy) 
= [(0, +%eA,)(0, + teA,) — (0, + teA,)(O, + 1eA,)] 
= ie0,(ALW) + ieA,O,p — ied, (App) — teALOLW 
= ie(0,A,)p + ieA, Op + ieA, Opp — te(AAp)p 
—ieA, Ow — teA,O,Y 
= ie(0,A, — Oj Ay) = ieFYy, (5.13) 


where Ff, represents the electromagnetic field strength tensor. The 
non-commutativity of the covariant derivatives arisesbecause of cur- 
vature in the space (in this case the Hilbert space) and we identify 
the observable curvature of the space with the field strength tensor 
Ps 

Similarly, we can derive the curvature in the case of a gravita- 
tional field (or a curved manifold) by taking the commutator of two 
covariant derivatives. We have defined the covariant derivative of a 
contravariant vector as (see (4.87)) 


D,A? = 0,A? —T2,A”. (5.14) 


It follows from this as well as from the definition of the covariant 
derivative for mixed tensors that (see (4.131)) 


D,D,A® = 6, (D,A°)+T),,D)A® —T8, DA 
= 0,(0,A? —T%,A) +1}, (A? — ThA’) 
—T?, (0, A* — 1, A”) 
= 0,0,A? — (0,1%,)A” —T,0,A” 
+ 1), (0,4? — 14, A7) — 1%, (0,A* — TA”). (5.15) 


Interchanging ps © v, we obtain from this 


DiDyAr = OpoyAP —{O,1 8 JAP le Ar 
se (OAPs Th AP) 1" (OAS = 1) AM) (5.16) 
and these results lead to the commutator of the covariant derivatives 
acting on a contravariant vector as 
[Dig DAS = (Dy DyA? = Dy DA") 
SOT AO ATT hie al a) Ae 


= (O1%e— 0,03, + Te lop Te ri )A" 


LL 

= PA’, (5.17) 
where we have identified 

Rou = OT G5 — Onl, +P Me — To Tio: (5.18) 


This is known as the Riemann-Christoffel curvature tensor of the 
manifold. Similarly, for a covariant vector we can show that 


[Du,DrJAp = Dyu(DyAp) — Dy(DyAp) 
= Oy(DyAp) +1), Dy Ap +P, Dy Ay 
—0,(DyAp) —T),,D,Ap —T), Dy Ar 
= OH (VA, =i? Lean) + i (Q,Ay + 1) Ac) 
—8, (OyAp + 19,40) — 12, (QnA, +12, Ac) 


lo o A Po A fro 
= 5 |ar?, > OW ep FP Yl — Pip “al Ag 


= —R° Aa. (5.19) 


Oo 

pp 
Furthermore, note that since the curvature tensor is a function of 
the connection and its derivative, if the space were flat the curvature 
tensor would vanish and then the covariant derivatives would reduce 
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to ordinary derivatives and they would commute. (Since curvature 
is a tensor, if it vanishes in one coordinate system, say the Cartesian 
coordinate system, then it will vanish in any curvilinear coordinate 
system.) 


> Example (Commutator of covariant derivatives on a tensor). Let us study how 
the commutator of two covariant derivatives acts on tensors, in particular on a 
second rank tensor T“”. From the definition of the covariant derivatives in (4.98) 
and (4.131) we have 


Dept SOD, Fs DP SP Dr Se ae, 


DID Tt =O ,(DT"'\h0%, DT STR DT! STDP ™ G20) 


Therefore, we obtain 


[Dx; DelT*” =O(D,T") +15 ,DeF” —TR, DT?’ —Th DoT?” 
— 0,(D,T*”) +13, DoT” —T#,D,T°’ —T8,D)T” 
= 0,(0,T —T4,7°” —1%,T"7) 
~~ eve (O,T°” —- ey Uae a. Doge”) 


—Tyo(0,T"? —Th T°?” — TTY") — (9p & A) 


=(0,05 Otte Ee eae 


t (Opole — Ooo +P inP Go — Mon YT? 


= Rt 


oXp 


Meee oe isa (5.21) 


We see that when the commutator of two covariant derivatives acts on a tensor, 
it leads to a sum of terms involving the Riemann-Christoffel curvature tensor. 
< 


> Example (Curvature for a two dimensional manifold). Let us consider the two 
dimensional space defined by the line element 


eS a(ae — da?). (5.22) 


The nonzero components of the metric tensor follow from (5.22) to have the forms 


1 1 io 
gtt = p Geax = 927 g” = ts Ig = -0. (5.23) 


The metric tensors depend only on the coordinate t and, therefore, the nontrivial 
components of the connection (4.49) 
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oO 1 co a 
Py = ~9 g . (Ougva + Ov Oru = OnGur) > BYUo= t, 2, (5.24) 


can be determined to have the forms 


1 1 1 
Poe Ss. Pee Pe eS Pea 2 
tt t ’ Lx t ’ tx xt t ’ (5 5) 


so that the geodesic equations are given by 


¢—',tée —T%,at = 0, 
fee 
or, &— zit =0. (5.26) 


In two dimensions there is only one nontrivial, independent component of 
the Riemann-Christoffel curvature tensor (5.18) which we denote as R’,;, and it 
has the form 


Rix = Onl = OTe a rah. ae Reales 


11 11 
see deat En Se, (5.27) 
Et tt 


It follows now that 


Rtatz = GP = = 


5.3 Parallel transport along a closed curve 


Let us next discuss the notion of curvature from a second point of 
view. To differentiate between flat space and a curved space, let us 
study the example of parallel transport of a vector along a closed 
path. First, let us consider a closed curve in flat space as shown in 
Fig. 5.1. If we take any vector and parallel transport it along the 
curve keeping it always parallel to itself, then when we come back 
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<a> 


Figure 5.1: Parallel transport of a vector along a closed path in flat 
space. 


to the starting point the final vector would coincide with the initial 
vector. This is the characteristic of a flat space. 

Let us next consider the surface of a sphere. If we move a 
vector from the north pole Q along the closed curve QPRQ always 
maintaining the vector locally parallel (always facing the south pole) 
as shown in Fig. 5.2, then when we reach the starting point (the 
north pole), the final vector would be pointing in a different direction 
from the initial direction. This is a characteristic of curved spaces, 
namely, when a vector is parallel transported along a closed curve 
it does not come back to itself. The angle between the initial and 
the final vector depends on the closed path. Therefore, the change 
in the vector must be related to the curvature as well as the closed 
path along which it is transported. 


Q 


é 


Figure 5.2: Parallel transport along a closed path on the surface of 
a sphere. 


From the definition of parallel transport in (4.14) we know that 
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when a vector is parallel transported infinitesimally in a curved space 
the change in its components is given by 


dé" (x) = I, (x)daé(z). (5.29) 


However, since the value of the connection is different at different 
points in a curved space-time manifold, the change in a vector parallel 
transported from point A to point B along path 1 would be different 
from the change along path 2 (see Fig. 5.3). The other way of 
saying this is that a vector parallel transported along a closed curve 
would not come back to itself. Therefore, the change in a vector 
around a closed path would be a measure of the curvature of the 
manifold. 


Figure 5.3: Parallel transporting of a vector from A to B along two 
paths. 


> Example (Parallel transport on the surface of the unit sphere). Let us consider 
a vector A® = (A®, A®) on the surface of the unit sphere with components (1, 0) 
at the coordinates (6 = 00, ¢ = 0), namely, 


A“ (9 = 60,6 = 0) = (A*(60,0), A®(40, 0)) = (1, 0). (5.30) 


We would like to calculate the components of this vector when it is parallel trans- 
ported along a circle at 0 = 00 and, in particular, when it is parallel transported 
back to the starting point. 

The line element on the surface of the unit sphere is given by 


dr? = d6? + sin” 6d¢’, (5.31) 


This is the same as the line element studied in (4.58) with r = 1,dr = 0 and, 
therefore, we have 
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goo =1, gos = sin’ 6, (5.32) 


and we can carry over the nontrivial components of the connection from (4.62) 


(vir vanishes in this case since g,, = 0) 


Too =sinOcosd, I$, =T%, = —cotd. (5.33) 
At this point there are several ways to solve the problem. Let us indicate only 


two methods. 


Method 1: The equation for the parallel transport of the vector A® has the 
form (see (5.6)) 


“dr (OgA - P3,A ) =0, a, B,Y = 0, ¢. (5.34) 


Furthermore, since we are parallel transporting along a circle where ¢ is changing 
but @ = @ is fixed, the relevant equation to study is 


05A° —TS.,(0 = 00) A” = 0, (5.35) 


which in components (with the use of (5.33)) becomes 


83.A° —1$4(8 = 00) A®* = 0, 
or, 03A° =sin Oo cos Oo A®, (5.36) 


dsA* —1$,(0 = 00)A° =0, 


or, 0¢A% = —cot Oo A’. (5.37) 
The two coupled equations (5.36) and (5.37) lead to 
a3.A® = —cot 0003A° = — cot (sin 80 cos 60A*) 
= — cos? 6).A®, (5.38) 


whose general solution has the form 


A® (00,0) = C sin(¢ cos 00) + D cos(¢ cos 90). (5.39) 


On the other hand, the “initial” condition (5.30) 


A® (00,0) =0, (5.40) 


5.3 PARALLEL TRANSPORT ALONG A CLOSED CURVE 143 


determines D = 0 so that we can write 


A® (00, ¢) = C sin(¢cos 0). (5.41) 


Substituting (5.41) into (5.36) we obtain 


4.A° (80, ¢) = sin Oo cos 00.A° (00,6) = C sin Oo cos Oo sin(¢ cos Oo), 
or, A°(00,¢) =—C sin 00 cos(¢ cos 90). (5.42) 


Imposing the “initial” condition (5.30) 


A® (40,0) = 1, (5.43) 
determines 
C =-— csc, (5.44) 


so that we can write the solutions to be 


A®(00,¢) = cos(dcos Oo), A®(00,¢) = — csc Oo sin(¢ cos 0). (5.45) 


We note that the length of the “initial” vector is given by 


900A° (80,0)A° (80,0) + 964A° (80,0)A* (80,0) = 1. (5.46) 


The length of the parallel transported vector at any subsequent point on the circle 
is given by 


900° (0, )A° (80, 6) + 965A® (80, 0)A® (80, @) 
= cos” (¢ cos 00) + sin” Oo csc? 09 sin”(d cos Oo) = 1, (5.47) 


which shows that the length of the vector remains invariant under parallel trans- 
port. However, the vector gets rotated (the components change) as we move 
along the circle. In particular when we return to the starting point, namely, 
when ¢ = 27, the vector has the components 


A® = (cos(27 cos 00), — csc Oo sin(27 cos A0)) , (5.48) 


which is different from the “initial” vector (5.30) unless 6) = 0, or 5, namely, at 
the pole or at the equator . 


Method 2: We have already determined the formula for parallel transport by an 
infinitesimal amount in spherical coordinates in (4.63). In the present problem, 
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however, we have dr = d@ = 0,r = 1,A" = 0 so that the change in the vector 
under an infinitesimal parallel transport along the circle 0 = is given by 


dA® =18.,(00)d@A® = sin Op cos 00. A° dg, 
dA® =T$,,(00)d@A® = — cot OA° dd. (5.49) 


Let us take the total angle traversed along the circle at 0 = 0) to be ¢ and 
divide it into N equal infinitesimal parts so that 


Ndd = 4, (5.50) 


where N is assumed to be large and we will take the limit N — oo,d¢@ > 0 
such that (5.50) holds. Namely, we have divided the parallel transport by a 
finite amount into a series of successive infinitesimal parallel transports of equal 
amount. From (5.50) we recognize that at every order of the parallel transport, 
the components of the transported vector will be related as (n < N) 


Abas _ 0 sin 09 cos Ood@ Ae 
(e) = E = € cot Ood¢ 0 Ag) ° (5.51) 


We can write the components of the vector after N successive infinitesimal parallel 
transports compactly as 


N 
AS = [[G+4,)45, (5.52) 


n=1 


where Ag denotes the “initial” vector (5.30) (in column form) and we have iden- 
tified 


Reve ( 0 sin 49 cos a) _ cos Io ( 0 nu) 


— cot Ood¢ 0 N — csc 09 0 
A 
Sy (5.53) 
where we have defined, for later use, 
= 0 sin 00 
A = ¢cos 0 € éseels 0 ’ : (5.54) 
Let us recall here the important identity that 
N 
: A A 
Nim, 1G + W? =e," (5.55) 


so that in the limit N — oo the transformation (5.52) takes the form 
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A (6) =e Ag. (5.56) 
Since the matrix in (5.54) squares to A? = —(¢cos60)?1 we have the general 
identities 

A?” = (-1)"(dc0s09)?"1, A?"*1 = (-1)"(dcos 0)” A, (5.57) 


which can be used to simplify the exponent in (5.56) 


e* = cos(¢cos 6) 1 + sin(¢ cos 09) A. (5.58) 


1 
cos 0 


Using (5.54) as well as (5.58) in (5.56), we finally have 


A® (60,9) _ cos(¢ cos 90) sin 0 sin(@cos60)\ (1 
(aeter, o)) (esa — esc 09 sin(¢ cos 60) cos(# cos 00) ) @ 
_ cos(@ cos 80) 
o — csc Oo sin( cos ai) : (5.59) 


which coincides with (5.45). 
< 


To calculate the change in a vector (in general) when paral- 
lel transported around a closed curve, we note that any finite path 
can be decomposed into many infinitesimal closed loops. And hence 
the problem of studying the change in a vector parallel transported 
around a finite loop is equivalent to studying the change in going 
around an infinitesimal loop as shown in Fig. 5.4. If we use the rule 
for parallel displacement (see (4.12)) 


de = ae 
Gee. cee da og 
or, aaa = vd ae id ; (5.60) 


then, the change in the vector in going around a closed infinitesimal 
loop can be written as 


Agt = face = gar < = far Tesi) — r(x). (5.61) 
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(79) 


Figure 5.4: Parallel transport of a vector along an infinitesimal closed 
path. 


Here we are assuming that the curve is parameterized by the proper 
time 7. Furthermore, the evaluation of the integral can be simplified 
by expanding quantities inside the integral around a fixed point on 
the curve, say, (7) = 2p. Since the loop is infinitesimal, we have 


I (2) =I, (xo) + (a — 20)?0,T%) (a0) + O ((a — 20)”), (5-62) 


and similarly, 


In 
‘— 
— 
co 
I 


EM (x) + 6€" (x0) 
= & (a9) +T%, (20) (x — ao)”€>(ax9). (5.63) 


Using these relations and keeping terms only up to linear power in 
(x—2o), the expression for the change in the vector in (5.61) becomes 


Det = fdr [Ty (00) + (2 — 20)" BP (00) + 


x [Mw0) + PA — a0)P€°(a0)]} 


V 


ae t (aw — 20)"€? (0) { IpE 40) 


da” 
+ PE (00)T 2p (0) } S| 


I 
Q 
= 

rae | 

a 

Se 
»- 
— 

8 

oO 
a 
Pan 
»- 
— 
8 
2 
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d 


gy 
dt 


= Pi (a0)é* 20) # ar 
+£° (x0) { IpU_ (#0) + 1%, (wo)P,(z0) } 


dx” 
Spee 
< gare x0) ca 
(5.64) 


Let us note that for a closed curve 


gar 


since the coordinate comes back to the same point. The expression 
for the change in the vector in (5.64), therefore, simplifies to 


x 


d 
d 


= f cn (5.65) 


T 


da” 
AgH = £7 (9) { pF Mo(t0) +12, (20) 25 (0) } f dr ah 
(5.66) 
Furthermore, let us note that 
da” d dx? 
Pe Ge pe | ne a ee 
gare ae gar [= re" «| 
y ae? 


Namely, the integral is anti-symmetric in the indices p and v so that 
we can manifestly antisymmetrize the coefficient of the integral in 
the indices p, v and write 


1 
As 7 9 §° (x0) {9,0 (xo) = OTS, (xo) 

d Vy 

HY, (to )Ppo-(#0) = P4 (2o)P 24 (a0)$ gar xP = 


£° (0) Riyyplta) far” (5.68) 


T 


1 
2 
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where R” ovp is the Riemann-Christoffel curvature tensor defined ear- 
lier in (5.18). 

The remaining integral is easily recognized as the area of the 
infinitesimal loop. This can be seen simply by recalling that in three 
dimensional Euclidean space the area enclosed by a curve is given by 


A = -5 fp de xr, 
1 


or, A; = —5 f cinder. (5.69) 
Thus, we note from (5.68) that the change in a vector parallel trans- 
ported along a closed infinitesimal curve at any point is proportional 
to the curvature tensor at that point as well as the area enclosed by 
the curve. Thus we see that the vector would not change when par- 
allel transported along an infinitesimal closed curve around a point 
only if the curvature tensor vanishes at that point. Furthermore since 
a finite closed curve can be thought of as a sum of many infinitesimal 
closed curves, when parallel transported along a finite closed curve, 
a vector would not change only if the curvature tensor vanishes iden- 
tically at every point in that region. 

If the curvature vanishes identically in any finite region of space- 
time, then, of course, any vector can be parallel transported along 
any closed curve in that region without any change. This simply 
means that in that region parallel transporting a vector from point 
A to B (see Fig. 5.3) is independent of the path along which the 
vector is parallel transported. Thus in such a case, given a vector at 
a point, namely, €“(a9), we can obtain its value at any other point 
€"(xz) in the region uniquely simply because parallel transport does 
not depend on the path. Furthermore, in such a case if we choose a 
curve x“(r), then along this curve 


dg* — da” Osh 
‘dr dr da” e) 
On the other hand, the law of parallel transport gives 
bb V 
ae ae (5.71) 


dr vr» dr 
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so that combining the two relations we obtain 


dx” da’ 
Me TH ww eX 
dr Ong Ur dr 
ae. SE (a gH pH é) a8 6. (5.72) 
, dr ¥ A arr =” 


Since this is true for any x“(7) we conclude that for this to be true 
the covariant derivative of the vector must vanish in that region, 
namely, 


Dé" =0. (5.73) 


Conversely, if we can find a vector field whose covariant deriva- 
tive vanishes in some region of space-time, the curvature must also 
vanish in that region. This follows because if 


Dé" =0, (5.74) 


then the vector can be parallel transported along an arbitrary closed 
curve without any change. This implies 


1 da’ 
b> Ph eo p = 
AE 5 Tipps gar x = 0, 
or, cir a = 0, 


or, = RY, = 0. (5.75) 
This can also be obtained from the fact that if the covariant derivative 
of a vector vanishes, then so will the commutator of two covariant 
derivatives and, therefore, the curvature. 


5.4 Geodesic equation 


There are several ways to derive the geodesic equations in a curved 
manifold. Let us first derive it from the requirement of general co- 
variance. 
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Let us ask how we can generalize the equations of motion for a 
particle in flat space to a curved manifold (in the presence of gravi- 
tation). The simplest equation to consider is, of course, the equation 
of motion of a free particle. In the locally flat Cartesian coordinate 
system we know that the equation is given by 


dct 
dr2 0; (5.76) 


where dr? = Nv dz"dx” is the proper time. If we now go to a general 
coordinate frame defined by 

x! (7) > a'#(x(r7)), (5.77) 
then 


dx da". da da 


ae ae ee ae ee) 


where it is understood that ie is evaluated at x(r). It now follows 


that 
dx’ 
(x) 
Ox! da’ 
Ox” drt 
Ox! d2x’ 2x4 da da’ 


~ Oa" “dre at OxOx” dr dr’ (e79) 


d?xt 


d 
ap de 
a 
dr 


In other words, the naive generalization of the free particle equa- 
tion (5.76) to a curved space does not transform covariantly under a 
general coordinate transformation and, consequently, it cannot rep- 
resent the true equation of motion if general covariance is to hold. 
On the other hand, we know that the affine connection changes under 
a general coordinate transformation as (see (4.22)) 
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He I 
Tw = ae 


Ox! Ox’! Ox! Or! Ax”! Ax™ 


A 3 M1 
= Or Axl” Ax!» M1 Or"1ar™ Ox Oa!r’ (5.80) 
so that 
pe, dav dat py da¥ da 
ur dr dr vr dr dr 
as poe? One. Oa — Arlt Axt Ax 
i= Ort Ox” Ox wiry Or41Ax™! AxlY Or» 
: Ox!” dx” Ox" dx 
Ox¥2 dr Ox dr 
5u MY Ox! mn O27't dr’2 dx 
v2™ 2 Orla 411 Ori or™ ae “aie 
= Ox! m7 62 7't dr”! dx! (5 es 
~ Og A Ontioa | nde "dae 
As a result, we see that the combination 
Pat A dx’ dx ee ii dr” dx 
dr? vr dr drt dr2 vr dr dr 
Ox! [22° dx” dx 
= aoe | aor Tee ae 5.82 
Or? dr2 vr ae sae ’ ( ) 


transforms like a contravariant vector under a general coordinate 
transformation. In a locally flat Cartesian coordinate system, the 
connection vanishes and hence this simply reduces to cat There- 
fore, from considerations of general covariance alone we conclude 
that the equation of motion for a free particle in a gravitational field 
(curved manifold) is given by 


d?xt dx’ da* 
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This equation is known as the geodesic equation and determines 
the trajectory of a free particle in a curved manifold. We recall here 
that the four velocity of a particle is defined as (see (2.5)) 


drt! 
ye (5.84) 
dr 
and behaves like a contravariant vector under a general coordinate 
transformation (see (5.78)). The free particle equation (5.76) can be 
written in flat space in terms of the proper velocity u as 


du 


As we have seen in (5.79), this is not covariant in a curved space and 
we recall that covariantizing this equation simply would correspond 
to replacing a by the appropriate covariant derivative (see (5.5)). 
Thus the covariant equation for a free particle in a curved manifold 
is given by 


Dut 
oe ts) 
Dr j 
du! dx’ 
o, TY, ~ ud =0. (5.86) 


If we now substitute the form of u/ in (5.84) into (5.86), we obtain 
the geodesic equation (5.83) or the equation for the free particle as 


d?rt ade’ da* 
a ara vee 


> Example (Affine parameter). We note that the geodesic equation (5.83) de- 
scribes the equation of motion for a free particle in a curved manifold 


dat p de” da* 
aa —— =0, 5.88 
dr2 uw dr dr ( ) 
where 7 is any parameter labeling the trajectory which we can choose to be the 
proper time as well. 

Let us choose a new parameter 
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8 =s(T), (5.89) 


to parameterize the trajectory. The geodesic equation (5.88), in this new variable, 
takes the form 


d /ds dx’ re ds dx” 
dr \ dr ds ur \ dr ds 
ds? da" (#) d?a" (f) ede aa 


ds dax* 
dr ds 


? 


or, 


dr2 ds dr) ds? \dr/ “ds ds ”’ 
d?xt da” da* as da" 

or, —--%Y——_—=-- aS (5.90) 
ds? ds ds (28) ds 


Thus, the form of the geodesic equation (5.88) will remain invariant in the new 
parameterization if 


or, s(r) =aTr+8, (5.91) 


where a and £ are constants. This is a linear transformation relating the two 
parameters — also known as an affine transformation. Parameters which leave the 
form of the geodesic equation (5.83) or (5.88) invariant are also known as affine 
parameters (related through affine transformations). 

< 


The geodesic equation can alternatively be derived as the straight- 
est path in a curved manifold as follows. From the study of parallel 
transport we have seen that under parallel transport the change in 
a vector is given by (see (4.12)) 


. Xr 
Santa (5.92) 


oe pee 
dr “dr °? 
bb v 
a. pe (5.93) 
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is parallel to itself along a curve 2“(r). We recall from the study 
of Euclidean geometry that a straight line carries the tangent along 
itself. So we can generalize this definition to a curved manifold by 
defining a “straight line” in a curved manifold as one which carries 
its tangent vector parallel to itself along the curve. Defining €" to be 
the tangent vector to the curve we have (this is the proper velocity) 


dx 
Le SS —, 
S drt 


Thus the tangent vector would be carried parallel to itself only if 


(5.94) 


dé n dR 35 
dr Pia dr a 
dat a oder? da* 
or, “d= = ean ae Pre = 0. (5.95) 


This is precisely the geodesic equation (5.83) and we conclude that 
the geodesic curve carries its tangent parallel to itself and hence is 
the straightest curve in a curved manifold. 


B 


Figure 5.5: Motion from A to B along a given trajectory. 


> Example (Solution of the geodesic equation). Let us consider the geodesic equa- 
tion in the plane in polar coordinates where the line element is given by 


dr? = dr? + r7de. (5.96) 


This is the same space as in (4.51) and, therefore, we can carry over the metric 
and the nontrivial components of the connection from (4.52) and (4.55) which 
have the forms 


grr = 1, Joo = r, g =1, Gg =a 


1 
Too =r, [ey =1%, = =3 (5.97) 
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The geodesic equations (5.83) are given by 


d?x" 
a 


In the present case, there are two equations. For w =r, we have 


d?r 
dr? 
dr 
dr? 


or, 


p dx” da* 
vw dr dr 


r dé dé 


d?6 9 dé dr 
are 7 ae ar 
a6 4 200 ar _ 
on dr2. rdrdr | 


Equation (5.100) can be integrated as follows 


a’o  2d0 dr _ 
dr? rdrdr 
pa ee Dede 2 
%  d8 dr? © op dr 
dr 
d 2 dd 
or, = n(x =) =0 
or, r ae = ¢=constant, 
dr 


? 


? 


= 0, 


ae de 


do\” 
-r(3) =9 


while for = 0, eq. (5.98) leads to 


? 


’ 
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(5.98) 


(5.99) 


(5.100) 


(5.101) 


where we recognize £ to correspond to the orbital angular momentum of the 


particle. Note that (5.100) and (5.101) lead to 


d { 5 (de? > dé a6 
z(: (s¢)') =» apap ¢ 


Using (5.102) in (5.99) we obtain 


(5.102) 
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or, 2——> — 2r— 


dr\* 4 fd@\? 
Here the constant of integration is chosen to be compatible with the line element 


(5.96). 
Since r = r(r),6 = 0(7), we can also write r in the parametric form 


dr _ dr dé 
dr d@dr’ 


er dr 2 /dr\’\ (ao? 
o-(2-2(4) (=) lO!) 


where we have used (5.100). Using this in (5.99) we obtain 


r= (0), 


() a) (5.105) 


The geodesic equation (5.105) describes a free particle motion and, there- 
fore, in Euclidean coordinates we expect the general solution to be of the form 


y=axrt+b, (5.106) 


where a, b are constants depending on the initial conditions. In polar coordinates, 
(5.106) can be expressed as 


b 
sin 8 — acos@’ 


r(0) = (5.107) 


where we have used x = rcos0,y = rsin@. Indeed it can be checked easily that 
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dr __—- b(cos@ + asin 8) 
dé —— (sin — acos@)2’ 


dr. b 2b(cos 6 + asin)? 
dé? sin 9 — acos@ (sin 6 — acos 6) 


2 (dar\? 
Stee (S) ; (5.108) 


so that (5.108) gives the general solution of the geodesic equation (5.105). This 
shows that the geodesic describes the straightest path in the manifold. 
< 


The geodesic equation can also be derived as the equation for 
the shortest (extremal) path between two points in a curved manifold 
as follows. Let 7,4, be the (proper) time taken by a particle in going 
from point A to B in a gravitational field (see Fig. 5.5), namely, 


a ‘i ae (5.109) 
A 
where (c = 1) 
ds? =d7? =9,,d2"de". (5.110) 


Let us assume that the curve x" along which the particle moves is 
parameterized by an invariant parameter A. Thus we can write 


1 
M& dx’\2 
dr = @ — da, (5.111) 


so that 


i ; dx! dx”\2 
x ev \2 
A A 


This is like an action and hence we can derive the equation of 
for the shortest (extremal) path simply by extremizing the action. 
Let us change the path infinitesimally, 
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z*(A) > 2 (A) + dx" (A), (5.113) 


subject to the condition 


da" (A) = 6x"(B) = 0, (5.114) 


so that the end points are held fixed during the variation. Further- 
more, let us note that the present action (5.112) has a one parameter 
group of gauge invariance. Namely, under the reparameterization 


\3t=2(0), (5.115) 


we note that 


dx da! _ dx!’ dvr 


ae dk dk a (118) 


so that we have 
i da’ da” 2 
Tips: = fa @ “an = 
A 
B 1 
+ fal #2) 
A 


( dx dd da” ay 
Inv FZ rs 


- fa a: de ak de 
1 
siege dz! dx’ \ 2 
ie a A. 
A 


1 
dx¥ dx” \ 2 
= [o @ 7a. =) = TAB: (5.117) 
A 
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Namely, the action does not change under the reparameterization 
(5.115) of the curve. This is reminiscent of the gauge invariance of 
Maxwell’s theory. Thus we can choose a gauge condition and we 
choose 


N=r. (5.118) 


With this choice of the gauge condition as well as the condition of 
fixed end points in (5.114) we obtain from (5.112) 


B 


iran = f ad ———--—_- 
2 (aw 3 )} 


‘A Fuv “ax “ay 
(Bas 8 8 4.99, 0% M0" 
Bay. <a ee ee 
B 
i dx" dda” 1 ada? “dz” 
— fa & ( “ax. ae “ 3 Ox Guy OX aN oo 
A 


B 

da’ dda” 1 dat da’ 
= d — — 4-0, — —— 6x7}, (5.119 

far(ow SEE Sa Hae), 

A 

where we have used (5.111) and (5.118). Integrating the first term 

by parts and remembering that the end points do not change under 

the variation (5.114), we obtain 


d - d da¥ 
iran = ae Ge | + fara" (sue) 
= A 
1 dh de. 
+5 Poti Ge ae 82" | 


B 
7 dx? dat d?xh 
fa [-6z zz dz ap 7 ou ae 
A 
dat da” 50 
dr dr 


1 
+ 5 OoGuv 
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B 

V LL 2 Aft 

= = f ar 50° ath ayn 
A 


dr drt “dr? 
_ 1 5 dx! da’ 
2 a dr dr 


B 
is d?at 
- far 6x | “a. 
A 
1 dx! da” 
ig 5 (On9u0 TF Ongve od OxGnis) ae om . (5.120) 


If the action has to be stationary along the actual path (geodesic) of 
the particle then, we must have 


5t45 = 0, (5.121) 


and since the variation 6° is arbitrary this is possible only if the 
integrand in (5.120) vanishes identically. Namely, the action will be 
stationary only along the trajectory given by the equation 


d2z# 1 dat! dx’ 
uo ee + 3 (O96: Op Gea = Oc) = aay Vou 


dan’ 1 Ao dzt dx’ 
ae dr? = 29 (QnGve + Gon — Oo Ip) ‘dr dr ; 
d?x dx! dx’ 
So Bree = 0. 122 
ae a Dia a Pa (5.122) 


This is, of course, the geodesic equation (5.83) which shows that the 
trajectory of a freely falling particle (under the influence of gravi- 
tation) is given by its geodesic which is the shortest (longest) path 
between two points. Furthermore, note that although the equation 
seems to be a vector equation, the particle in reality has only three 
independent degrees of freedom. The other degree of freedom is 
expressed in terms of the independent degrees through the gauge 
condition (5.118) 


N=7, (5.123) 
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which leads to (see (5.111)) 


dr | dzt da’ 2 
ae ee a a 


Or: euler 1, (5.124) 
We have already seen such a condition in the case of the Lorentz 
covariant description of particle motion (see (2.8)) and we conclude 


that the particle truly has only three independent degrees of freedom 
as it should. 


5.5 Derivation of geodesic equation from a Lagrangian 
The discussion of the last section suggests that the motion of a free 
particle in a curved manifold can be given a Lagrangian description 


much like in flat space. As we have seen in (5.112), the action for 
the free particle can be written as 


B 1 
dg’ da’ \ 3 
TaB= fa @ a =) 5 (52125) 
A 


where (see also (5.111)) 


1 
dr da! dx’ \2 1 
—<— | —_— —— = pl AV’ > 
an @ ar x) (GightVer an)? (5.126) 


where a dot denotes a derivative with respect to the parameter 4. 
Thus we can think of the Lagrangian for the system to be 


da! da’ \2 ere: 
= (sole) Te Ge) = omlenitaryt, (5.127) 


which is a function of the coordinate x“(A) and the velocity «“(A) 
with 
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OL 1 1 


SF Oy ” = —  — Qur%’, 
us 2 (aaa)? (gagtt8)? 
OL dy dx” da” (5.128) 
8 Bee de SHY ay SHY ar 
where we have used (5.111). 
Similarly, we also have 
OL 1 roe 1 di da? da” 


ee ae 


= 5 Gov — (5.129) 


ret AN de ge Sone 

: dr d dz’ = 1 dr 9 dx? da’ a 

SP Nae det Da ods deo 

: dr d2x” mn dx? dx’ = 1 dx? dx’ -_ 
OE ay [dee de dp 2 Har dr | 


dev 1 dx? da’ 
or, Inv 4 + 5 (Oo Suv Te Ov Gyo = OnJov > ae 


=0, (5.130) 


Multiplying with the inverse metric, this can be written as 


des 1, det “dz” 
gh = ink: = 
dr? o 59 (Oo guv + OvGpo — OnGov) dr dr 
Fea dz? dx” 
oo x d x x = () (5.131) 


ie ae ae 
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where we have used the identification (4.49) for the Christoffel sym- 
bol. This is, of course, the geodesic equation (5.83) that we have 
derived earlier from various points of view. As we will see in the 
next chapter, the Lagrangian description gives a simpler method for 
determining the components of the connection. 


> Example (Alternative action for the geodesic). For a massive particle, the ac- 
tion that leads to the geodesic equations can be written as 


72 1 
da" dx” \ ? 
TL 


which coincides with (5.112) except for the multiplicative factor m (mass of the 
particle) which makes the action dimensionless. The Euler-Lagrange equation 
following from this gives 


2 y Xr 
Gee pe ee ag, (5.133) 
dr? dr dr 
which coincides with the geodesic equation. There are several things to note here. 
First of all the parameter 7 can be any affine parameter labelling the trajectory 
and not necessarily the proper time unless a specific gauge choice is used (see 
(5.118)). Furthermore, the geodesic equation is the same for any massive particle 
(the mass drops out in the equation) reflecting the fact that gravitation acts the 
same way on all particles. 

However, there are certain disadvantages in using the action in the form 

(5.132). For example, the action is meaningful only for time-like trajectories 
(because of the square root) for which we have 


da” da” 
Suv dr dr 


Furthermore, since mass is an overall multiplicative parameter in the action 
(5.132), it is not a good action for massless particles for which it identically 
vanishes. For these reasons we look for an alternative action which would lead 
to the geodesic equations as Euler-Lagrange equations and would also overcome 
the disadvantages in (5.132). Normally this is done through the introduction of 
auxiliary (non-dynamical) variables (fields). 

Let us consider the action 


> 0. (5.134) 


Ti T2 
‘ 1 1 dx" da” ; 
TL 


T1 
where F is an auxiliary (non-dynamical) variable. We note that the action (5.135) 


is defined for arbitrary m including m = 0. Similarly, since there is no square 
root, the action is defined as well for 
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dx" dx” 
a oe <0, (5.136) 


in addition to time like trajectories. 


Since F' is an auxiliary variable (there is no af term in the Lagrangian), 
T 


the Euler-Lagrange equation for F’ is quite simple and has the form 


OL 
or” 
i oa 1 da” da” 1 a, 
> Bps72 Sea de T QF ~™ 
1 da” dx” 
or, F= me 24" Gr dr’ (5.137) 


This is a constraint equation (not a dynamical equation) and if this is used to 
eliminate F in the action (5.136) we recover the action (5.132) 


ge 1 | m da’ da” dat dx’ 
= dx» dae a dr dr ae dr dr 
pa ae 
if dar d 
wu v 
zs m f ar Su =§. (5.138) 
T1 


Therefore, we see that the alternative action (5.135) is equivalent to the action 
(5.132) if the constraint involving the auxiliary variable F is used. Therefore, the 
action (5.136) would also lead to the geodesic equation which can be explicitly 
verified from the Euler-Lagrange equation for x“ together with (5.138). However, 
as we have emphasized this alternative action does not suffer from the criticisms 
which apply to (5.132). 

The variable F' can be identified with the induced one-dimensional metric, 
grr, on the trajectory. This alternative formulation of the action is particularly 
useful in theories like string theory. 

< 


> Example (Geodesic equation in polar coordinates). The action for a particle 
moving on a plane in polar coordinates is given by 


oS aw = = fa (7? + 776"), (5.139) 


where dots denote derivatives with respect to t. The Euler-Lagrange equations 
lead to the dynamical equations 
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a 0, 
r°6 + 2r76 = 0, 
By ee 
or, 0+ pee =0. (5.140) 


Let us note that the geodesic equation (5.83) describing the motion of the particle 
for the present problem can be written as (y,v,A = r, 6) 


d?x" pe da” da* _ 
dr2 dr dr’ 
dr\? (d?a" da” dx* 
al of oe SN as 
a (St) ( dr? dr =) 6 
ot, 2 Ie" a" = 0. (5.141) 


Comparing (5.140) with (5.141) leads immediately to the nontrivial components 
of the connection 
V0 =T, 


i, =Te= -<. (5.142) 


These coincide with our earlier results derived in (4.55). 


This page intentionally left blank 


CHAPTER 6 


Applications of the geodesic equation 


6.1 Geodesic as representing gravitational effect 


We have seen that the geodesic equation leads to the extremal path 
in a curved manifold. However, it is not clear if it incorporates the 
true effect of gravitational force on a particle. More specifically, we 
may ask how we can choose the equation 


d? xt dx” da 
are =4 5) ae ae (6.1) 


over, say, the equation 


d? xt ae da” a é da’ dx 
— pee ies SS 2 
dr2 vr» dr dr OR xg? dr dr Os ez) 


as representing the motion of a free particle in a curved manifold. 
Here a is a constant, R" yy, denotes the Riemann-Christoffel curva- 
ture tensor defined in (5.18) and S$? is the spin vector of the parti- 
cle. General covariance arguments cannot distinguish between the 
two since both the equations transform covariantly like contravari- 
ant vectors under a general coordinate transformation. Furthermore, 
the principle of equivalence cannot distinguish between the two either 
since in a locally flat Cartesian coordinate system 


PaO, es 0. (6.3) 


Therefore, both the equations (6.1) and (6.2) reduce to the free par- 
ticle equation in a Lorentz (inertial) frame, 


d2at 
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The answer to this interesting question comes from the observa- 
tion that whichever equation represents the true equation of motion 
must reduce to Newton’s equation of motion in the nonrelativistic 
limit. Let us consider a particle falling freely under the influence 
of a weak, stationary gravitational field produced by a point mass 
M. Classically we know that the gravitational potential at a point 
x (with the mass M at the origin) can be written as 


aes (6.5) 


where Gy denotes Newton’s constant. Newton’s equation of motion 
for a particle in this potential has the form 


d?x 


qe = 7 Ve): (6.6) 


If we assume that the geodesic equation represents the true equation 
of motion incorporating the effects of gravitation, then we have 


dat i ee de 
Gr ka ae — 


and it should reduce to (6.6) in the appropriate limit. Furthermore, if 
we assume the gravitational field to be weak, the metric would change 
only slightly from the Minkowski metric and we can decompose the 
metric as 


Juv (£) = Nv + Pw (2), (6.8) 


where h,,,(x) is assumed to be small in magnitude. Consequently we 
can write the inverse metric as 


g(a) = fh — Be, (6.9) 


so that 
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I? 


gl” (x) gur(x) (n™ — AMY) (nx + hex) 


I 


a ix — Wr a + if Niwa 
a? 0h (6.10) 


We note that the indices of h,,,h’“” can be effectively raised (or 
lowered) with the Minkowski metric (since h,, is assumed to be a 
weak field). 

Since the gravitational field is stationary, the metric must be 
independent of time and we have 


OoIuv = Ohyw = 0. (6.11) 


Furthermore, if the gravitational field is weak and the particle is 
nonrelativistic, then we note that we can neglect terms of the type de 
in (6.7) in comparison to ae = +. (This is equivalent to assuming 
|v| < 1 with c=1.) With these assumptions the geodesic equation 


(6.7) in this limit reduces to 


Qi cig. es dt 
ate is es 6.12 
dr? dr dr re) 
Let us also recall the definition of the connections (see (4.49)) 
lu 1 up 
a = 5 g (OL9Xp =F OrIpv im OpnGur), (6.13) 
from which it follows that 
lu 1 up 1 up 
Too = fe (0090p + A09p0 — A900) = ao OpGo0; (6.14) 


where we have used the static nature of the metric tensor (6.11). 
Keeping only terms linear in the field h,,, we have 


1 
Moo = 5 1” Aphoo, (6.15) 
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so that 
0 ; hs 


Substituting these into (6.12), we see that the geodesic equation 
now reduces to 


dt 

dr2 i 

dx 1 dt \? 

paerion Bayes — — 0. “Al 
qe + 9 Vhoo(x) (+) 0 (6 7) 


The first equation simply says that ft is a constant. Therefore, we 


bs : 2 ; 
can divide the second equation by ( at) to obtain 


d?x 1 

— =-= WA , 6.18 
qe 5 Whoo (x) (6.18) 
This has the right form as the classical Newtonian equation (6.6) and 
hence we conclude that the geodesic equation (5.83) represents the 
true gravitational effects. Furthermore comparing with the classical 
equation (6.6) we see that in the weakfield limit we can identify 


2GnM 
|x| 


hoo(x) = 2¢(x) + constant = — + constant. (6.19) 


The constant can be determined by imposing the asymptotic condi- 
tion that infinitely far away from the gravitational source (the mass 
M) the metric must be Lorentzian (Minkowski) and hence 


lim, «Agee —=¥ 0, (6.20) 


|x|—-00 


which determines the value of the constant to be zero. Consequently, 
we determine the metric to be diagonal with 
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2GnM 
hoo(x) = 2¢(x) =-——, 
|x| 
2GnM 
or, goo(x) = oo +hoo(x) = 1+ 2¢(x) = 1—- ra ; 
9j(X) = My goi(x) = 0. (6.21) 


In this case, far away from the source the invariant line element 
reduces to 


dr? = Gwdetds™, 
dr\? a dx! dx’ 
On ae Su “ae dt 
da’ dx 


ix 
drt : 2GM \2 
Go wr) 
Bye ite x\7 GM 
a. aes 
= 1-(5 w*- 009) 
- 1-(r-Vv), (6.22) 


where we have used |x| < 1 and |x| >> 1. (T and V denote respec- 
tively the non-relativistic kinetic energy and the potential energy of 
the particle scaled by the mass of the particle.) Therefore, the action 
for the system can be written as (see (5.112)) 
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= <¢onst - fat (T —V) 


= const - [a L, (6.23) 


which is familiar from classical mechanics. This leads to the dynam- 
ical equation from 


-—6T = 5 fat L=0, (6.24) 
which we recognize as the minimum action principle. 


6.2 Rotating coordinate system and the Coriolis force 


The geodesic equation holds not only in a gravitational field but 
also in any curvilinear coordinate system. Therefore, we can again 
test the validity of the geodesic equation by expressing it in a uni- 
formly rotating coordinate system in flat space and comparing with 
the classical expressions for the Coriolis and the centrifugal forces. 

Let us recall that the earth provides an example of a uniformly 
rotating coordinate system. If we assume that the earth is rotating 
around the z-axis with a constant angular frequency w, then this 
system has cylindrical symmetry as shown in Fig. 6.1. Therefore, 
we can write the invariant length in the inertial frame or the space 
fixed frame in cylindrical coordinates as 


dr? = dt? — (dr? + r2dd? + dz’). (6.25) 


We can go to the uniformly rotating coordinate system from this 
through the transformation 
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Figure 6.1: Rotating earth as a natural coordinate system with cylin- 
drical symmetry. 


t — f, rT, 
Zo B, o> o+ut. (6.26) 


In the rotating coordinate frame the invariant length (6.25) takes the 
form 


dr? = dt? — (dr? + r?(d(¢ + wt))? + dz?) 
= dt? — (dr? + r7d¢? + r2wdt? + 2r2wdtd¢ + dz?) 
= (1—r’w*)dt? — 2r?wdtdd — (dr? + r?2d¢?+dz7). (6.27) 


Therefore, we can simply read off the components of the metric tensor 
in this frame as 


Gt = (1 = ru), Grr = —I, 
I¢¢ = —r, 9zz = —1, 
96 = Gor = —T7w. (6.28) 


It is worth noting here that the components of the metric tensor 
depend at best on the radial coordinate r. This also shows that if 
we restrict ourselves to time-like motion 
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gt = (1—7?w?) > 0, (6.29) 


we must have rw < 1 and accordingly we assume that the frequency 
of rotation is small. 

We note that we can invert the metric tensor in (6.28) and 
calculate all the components of the connection. However, (as we have 
already mentioned) a simpler way of determining the connection is 
to look at the dynamical equation from the Lagrangian (see (5.130)) 
and identify the coefficients of the double derivative terms, namely, 
terms of the kind a a with the components of the connection (see 
(5.83)). To do this we write the geodesic equation in the alternate 


form (5.130) (by multiplying with the metric tensor) as 


deg’. da? dx” 
Gv are + 3 (O59u» + Ov Guo oom OnJve) as = —0. (6.30) 
The r equation (4 = 7) has the form (a dot here denotes a derivative 
with respect to the proper time T) 


dr 1 dx? dx’ 
Orr ape + 5 (OsGrn + Ov Gro = Ove) ae az = 0, 
ad ery 
or, —T— 5 Orguek’ &” = 0, 
or, e+ 3 (oueil + Ogg PP + 2,46) = 0, 
1 : , 

or, r+ 3 (—2rw? — arg? — 4ruid) = 0, 
or, *—rw*t? — rd? — 2rwid = 0. (6.31) 


Therefore, comparing with the form of the geodesic equation (5.83) 
for uw =r, we conclude that 


Ty, =r’, Vo6 =P; Vig = Gt = Tw. (6.32) 


Similarly, the z-equation (4 = z) is given by 
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d2z 1 da? da” 
Gazz Py + 3 (Oo9ev + OvgGzo — O29vc) es ae 
or, A 0, 
ae. sa: (6.33) 


so that we have 


Tz, =0. (6.34) 


The ¢ equation (y= ¢) is similarly given by 


do d2t 1 dx? dx’ 
Gos-qra + Ist qa + 9 (QoGev + A960 — Vs9ve) FOG =O 


or, —r26 — rewt + OoGovt° £” = 0, 


or, rotrrwt — Gear — Orggtrt = 0, 


or, 2b + r?wt + 2rd + 2rwrt = 0. (6.35) 


Equation (6.35) can also be written as 


d ; ; 

ne (726+ r?wt) = 0, (6.36) 
which shows that r2(¢-+w4) is a constant of motion for the problem 
under study. Finally, the t-equation (u = t) has the form 


Mt - + Ite a + 5 (Oogw + WGte — W89ve) — < = 0, 
or, (1 —r2w?)t — r?wd + Oo gy t7%” = 0, 
or, (1 —r2w?)t — r?wd + Op gyrt + On grsr =: 
or, (1—r’w*)t — r2wd — 2rwrt — 2rwrd = 0. (6.37) 


Equation (6.37) can also be written as 
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¢ (¢ — r?w(d + wt)) = 0, (6.38) 


dr 


which shows that i — r?w(¢ + w#) is a conserved quantity. In fact, 
from these two conserved quantities, we conclude that ¢ must be 
conserved. 

That ¢ is conserved can also be seen directly as follows. Multi- 
plying (6.35) by w and adding to (6.37) we obtain 


(1 — r2w?)t — r?wd — 2rwrt — 2rwrd + r2wd + 1r?2wt 
+2rwrd + 2rw?rt = 0, (6.39) 
which simplifies to give 


t=0, or, ¢ = constant. (6.40) 


This determines 


LO. (6.41) 


Similarly substituting (6.40) into (6.35), we obtain 
r¢ + arid + 2rwrt = 0, 
te Oe SO as 
or, o+-réd+- wrt =0, (6.42) 
r r 


which determines 


ae ae o_o _ 
yy = 0G, = re, =r? =- (6.43) 


This completes the determination of all the components of the con- 
nection. Namely, the nontrivial components are given by 
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Ty, = rw, Pog =T; Vie = Ube = 7, 
1 Ww 
Py = TG, =, Tm = Ty = ——. (6.44) 


To see the centrifugal and the Coriolis forces, let us note from 
(6.40) that 
. dt 

t= a constant. (6.45) 


If we choose the scale of time suitably then the constant can be taken 
to be unity in which case we can identify 


=a (6.46) 


(Another way to say this is to note that we can divide the dynamical 
equations by factors of a) Furthermore, in the rotating frame, 


o=0, (6.47) 


so that the r-equation (6.31) becomes (remember ¢ = 1) 


a 0, 
Of).. =e Sra", (6.48) 
which gives the familiar centrifugal force. 


Using (6.47), the Coriolis force is obtained from the ¢-equation 
(6.42) and reads as (recall ¢ = 1) 


ro + arid + 2rwrt = 0, 


dé dr 
2 
or, r qe tere a =o 
2 
or, 7 GY + Qu cee 0. (6.49) 
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We recognize the second term as the classical Coriolis force. (We note 
here that in a non-uniformly rotating frame, the force generalizes to 
what is known as the Euler force.) These examples give us further 
confidence that the geodesic equation is the true equation of motion 
for a free particle in a curved manifold as well as in a curvilinear 
system of coordinates. 


> Example (Calculation of connection from action). The calculation of the com- 
ponents of the connection from the definition (4.49) is, in general, quite tedious 
and as we have mentioned earlier, the calculation can be simplified enormously 
by considering the action for the geodesic equation. The action for the geodesic 
equation has already been described in (5.112) (see also (5.132)) and an alterna- 
tive form is given in (5.135). For purposes of calculation, however, there is still 
a simpler form of the action for a massive particle (we do not write the overall 
multiplicative mass term) given by 


TB 1 TB 
S= [ar 5 gmat” = fu a8 (6.50) 
TA TA 


where a dot denotes a derivative with respect to 7. The Euler-Lagrange equation 
for this action leads to 


d OL OL 


dr daze Ore’ 


Vv 


d we tate 
or, a (gut )- 5 (Ongov)é z =0, 


oD *=OsV 1 si 2 
or, guvd” + (Dogan i°&” — 5(Ougon)é"&” = 0, 


1 aay 
or, Quvit” le 2 (Oo Gur + Ov Guo a Ongov) Ce = 0, 


1 
or, xh + so (Oc9dv + OvGor aa OnGov) veg” _ 0, 
or, ¢*—TS ¢°¢” =0, (6.51) 


which we recognize to be the geodesic equation (5.83). The action (6.50), there- 
fore, also leads to the Euler-Lagrange equation and is much simpler to manipulate. 
Let us next consider a space described by the line element 


dr? =e?” dt? — @? dr? — r? (d6? + sin? 0d¢”) . (6.52) 


The nonzero components of the metric tensor are obtained from (6.52) to be 


ge =O", grr =—€™, gag =—1?, gee = —1’ sin? O. (6.53) 
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Consequently, the action for the geodesic (6.50) can be written as 


TB TB 
S= part = fa (eons = PMG? — 926? — 7? gin? 06°) , (6.54) 
TA TA 


where a dot denotes a derivative with respect to 7. The Euler-Lagrange equations 


aL aL 
dr Ort Oat 


following from (6.54) (for w = t,r,6, 0%, respectively) are given by 


=0, (6.55) 


ene 267" ay! (r) rt =0, 
— PAM CPO al (ri? _ er) \ (r)F +r6° +rsin’ 0¢" =0, 
5 Ss BS ss 
—r°0—2rrd+r° sind cosd¢* = 0, 
—r* sin? 06 — 2r sin? 07¢ — 2r? sin 0 cos 066 = 0, (6.56) 
where a prime denotes a derivative with respect to the radial coordinate r. 
We can simplify and rewrite the equations in (6.56) also as 
i+ 20! (r)Fé = 0, 
ee 28 )-200) a! (i? es dN (r)F? — pen 2A) 92 — pe-2 MM) gin? od? = 0, 


6+ 276 - sin 0 cos 067 = 0, 


6 +224 + 2cot 066 = 0. (6.57) 


Comparing these with the geodesic equations (5.83) or (6.51), we obtain the 
nonzero components of the connection to be 


Tre = Tir = —¥'(r), Se ene); 
Feat), Tyee 
. 1 
I’. = +2 fa) 2X(r) Tr? r? rT? r? om 
o¢ r sin é€ 3 r@ Or rp or r’ 
1 
te 5 sin 20, 3, =P%, = — cot 0. (6.58) 


This shows that it is a lot easier to derive the components of the connection from 


the action for the geodesic. 
< 
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6.3 Gravitational red shift 


In flat space we are all familiar with the phenomenon of Doppler 
shift. Let us briefly recapitulate here the principle behind this before 
discussing the phenomenon of gravitational red shift. Let us consider 
the following one dimensional problem. A source S which emits a 
light wave of frequency vs moves with a velocity v with respect to 
an observer O. (For v > 0, the source moves away from the observer 
while for v < 0, it moves towards the observer.) If we denote by dt, 
and dt, the time intervals (between two events) as seen in the source 
and the observer frames respectively, then they are related as (c = 1) 


dto = y(v)dts, (6.59) 
where (basically if the observer is at rest and the source is moving 
with a velocity v, then dr = dtp = y(v)dts) 

Q\—4+ 
y(v) = (L—-v*) "2. (6.60) 


If At, denotes the time interval between two successive pulses 
in the frame of the source, then during this interval the source would 
have moved a distance 


Ats v. (6.61) 


Consequently, the time interval (distance) between successive crests 
of a light wave in the source frame is given by 


ie = Ats + Ats v= iL + v)Ats. (6.62) 


Using (6.59), we conclude that the time interval seen in the frame of 
the observer is given by 


Ato = Y(v)Ats =7(v)\(1+v)Ats, 
Ato 


ve (6.63) 


II 
~— 
“——~ 

fond 
eal 
a. 
an 
~ 
ford 
—S" 


or, 
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Consequently, if v; and vo denote respectively the frequencies of the 
photon as observed in the source and the observer frames, then from 
(6.63) we obtain 


ae Fe waite _ (i+e)\} 
rm )(1+ v) Geae (=) : (6.64) 


For small v, this can be approximated by 


Ee hiy, “Sap ty, (6.65) 
Vo As 


which shows (see also (6.64)) that if v > 0, 


Vs > Vo; Xs < Ne: (6.66) 


Namely, if the source is moving away from the observer, then the 
frequency of the light wave measured by the observer will be red 
shifted. 

On the other hand, if v < 0, namely, if the source is moving 
towards the observer, then it follows from (6.64) or (6.65) that 


Vs < Vo; Xs = As: (6.67) 


Namely, in this case, the frequency of the light wave measured by the 
observer will be blue shifted. This phenomenon is known as Doppler 
shift. A similar phenomenon is also observed in the presence of a 
gravitational field resulting in a red shift in the light wave (spectral 
lines) emitted by stars. 

Let us assume that the gravitational field we are interested in is 
produced by a static stellar body (star) of mass M and is weak. For 
such a field we have already determined the metric to be diagonal 
with (see (6.21)) 


goo(x) = 1+ 26(x) = 1 - —— Gig = Nij- (6.68) 
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Let us suppose that a photon (light wave) is emitted by the star and 
is moving away in this gravitational field as shown in Fig. 6.2. Let 
us assume that at point A, the light wave (photon) has a frequency 
v, and that when it falls to the point B the frequency has changed 
to vg. At point A, the total energy of the photon is 


Ey, = hv, (6.69) 


star 


Figure 6.2: Light emitted by a distant star passing through two 
points A and B. 


where we are assuming that the gravitational potential is zero at this 
point (reference point). Furthermore, we know that for purposes of 
gravitational interactions, a photon of frequency v can be thought 
of as a particle of mass m = hy (recall c = 1). Thus like any other 
particle, when the photon falls in the gravitational field it would gain 
a potential energy equivalent to m|A@|. Therefore, we conclude that 
the total energy of the photon when it reaches the point B would be 


Energy conservation leads to 
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Fry = Es, 


or, hvy = hvg+mlAg| = hvg + hvgl|Agl, 


1 tL GyM 
Or, VB —— es On MAL ~1l- a Ar. (6.71) 
Py 1+|Adl 1+ au Ar r 


Here Ar is the radial separation between the two points and r is 
the radial distance of point B from the star. This shows that the 
frequency of the photon decreases as it falls in a gravitational field. 
We can calculate the ratio of the wavelengths from the relation (re- 
member c = 1) 


\v = 1, (6.72) 


so that we have 


M 
Ar. (6.73) 


In other words, the wavelength of the light wave received at B would 
be longer than that at A. That is, there would be a red shift in the 
photon frequency and wavelength. Let us emphasize here that in our 
calculation we have kept only linear changes. The photon mass also 
keeps on changing as it falls in the gravitational field. We have not 
taken such effects into consideration since they only induce higher 
order corrections. 

We can also derive the red shift (6.71) or (6.73) purely from the 
principle of equivalence. According to the principle of equivalence 
the effect of gravitation can be completely rotated away locally by 
simply changing coordinate frames. Equivalently we can think of the 
observer at B as accelerating downwards with a constant acceleration 
g with respect to the observer at A (we note that the observer at A 
is attracted more strongly to the star than the observer at B). If 
at t = 0 both the observers are moving with the same velocity, then 
observer B would move away from A with time. Let observer A send 
a light signal at ¢t = 0 towards the observer B as shown in Fig. 6.3. 
If the signal reaches B at time t, then (c = 1) 
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‘ 
Fa 


Figure 6.3: Gravitational red shift from the principle of equivalence. 


t=h, (6.74) 


the radial separation between the two observers. Here again we 
are neglecting higher order terms like retardation due to motion of 
the observers. Clearly, in this time interval, observer B would have 
gained a velocity with respect to A which is given by 


|Av| = gt = gh. (6.75) 


Thus the light wave reaching B would be Doppler shifted and the 
wavelength would be given by (see (6.65)) 


Ap = Aa(14+|Avj), 
or, ae) = 1+4g9h. (6.76) 
Aa 


That is, the light wave would be red shifted. Furthermore, this is 
the same result as in (6.73) if we identify 


GnM 


2 


,  h=Ar. (6.77) 


It is useful to keep the limitations of this method in mind. Principle 
of equivalence applies only locally and hence this method is not ap- 
plicable when we are talking about widely separated points in space 
time. 
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We would next derive the gravitational red shift through yet 
another method, namely through the dilation of time. But to do 
that we have to discuss how measurements are done in a curved 
manifold or in a gravitational frame of reference. First of all we note 
that the invariant proper time interval in the presence of gravitation 
(in a curved manifold) is given by 


dr = (gu(x)da"dx”)? . (6.78) 


Unlike the Minkowski space, it is only the infinitesimal proper inter- 
val that makes any sense in a curved manifold. There is no global 
proper time. This can be seen from the fact that the time interval 
between two events A and B 


(At) az = i dr, (6.79) 


TA 


is path dependent. Therefore, although it is invariant under a co- 
ordinate change it is not very useful. (By the way, this kind of an 
approach gives a simple solution to the twin paradox which we will 
discuss shortly. Since the two persons have two different world lines, 
there is no reason why the associated time intervals would be equal.) 
In contrast, however, the coordinate system used by any observer 
gives a unique time interval between two events although it is not 
coordinate invariant, namely, 


TB 
(= i en oe (6.80) 
TA 


is unique but not invariant. The proper time of a clock is, of course, 
related to the coordinate time by the relation 


dr = (go9)2dt, (6.81) 


when the clock is at rest. 
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Let us now extend these ideas to the case of a star which sends 
out N light waves of frequency Vtar. If the interval between pulses 
as measured by an observer on earth is dt, then the proper time 
interval between pulses at the star is given by (i.e., the time interval 
measured by a clock on the star) 


1 
dTstar = (goo (Setar)? dt. (6.82) 


Similarly on earth, the proper time interval between pulses is 


1 
ATearth _ (goo(®earth))? dt. (6.83) 


Let us assume that the N waves are emitted in a proper time 
interval Azgtar. On earth N waves are received but in an interval 
ATearth: Thus, we have 


N = VetarATstar, N= Paw Teaath (6.84) 


Clearly then 


1 
Vstar = ATearth (goo (ei) 2 


Vearth ATsar (goo (star)? 


= dts 2Pearth 2 
1 + 2dstar 
= I+ (dearth = Datar) 
= 1+ Ad, (6.85) 


where we have used (6.68) as well as the fact that the gravitational 
potential ¢ is negative and inversely proportional to distance (see 
(6.5)). This again shows that the frequency received on earth would 
be lower or correspondingly the wavelength on earth 


Acarth 4 4 |Ag| > 1, (6.86) 


star 
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would be longer. Consequently we say that a light wave moving away 
from a gravitational source must be red shifted. 

The phenomenon of gravitational red shift has been experi- 
mentally verified. If we assume that light waves emitted by a star 
(Ms, Rg) are received on earth (Wz, Rp), then (6.85) gives 


“ = |Ad| = Gn (z= os ) ; (6.87) 
S 


The red shift of hydrogen spectrum of the white dwarf star 40 Eridani 
B is measured and the value 


A 
(>) =7% 105%, (6.88) 
A EXPT 
is obtained. Theoretically we can calculate the value to be 
AX 
(>) = 5.6 x 10-°, (6.89) 
A TH 


where Mg, Rg are used as inputs. This is in quite good agreement. 
However, astronomical tests of red shift are hard since there is a lot of 
error due to other cosmological effects. For example, the expansion of 
the universe causes a Doppler shift, the overall metric of the universe 
is not so simple as we have used and so on. 

Therefore, one looks for terrestrial measurements of gravita- 
tional red shift and since the shifts involved are so small we may think 
it is humanly impossible to verify them on earth. It is, however, pos- 
sible because of the sensitivity gained through the Mossbauer effect. 
Such a measurement involves the study of the y ray spectrum of a 
radioactive nucleus. In general it is hard to study the y ray spectrum 
of a nucleus since the emission of the 7 ray leads to a recoil of the 
nucleus which then Doppler shifts the spectrum. What Mossbauer 
had observed is that in certain crystals like Fe°’, the entire crystal 
rather than the nucleus alone, picks up the recoil and hence the pro- 
cess becomes virtually recoilless. In fact Fe°’ shows a line at 14.4 keV 
with a fractional half width 10~'%. If a radioactive emitter, say of 
Fe®’, is placed near a nonradioactive absorber of the same material, 
then, of course, it would lead to resonant absorption. However, if 
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the emitter is moved with a slight relative velocity, then the Doppler 
shift would change the frequency and hence the resonant absorption 
would be completely wiped out. 

To measure the gravitational red shift, an emitter was placed 
at the bottom of a 21.6 m (72 ft) tower and an absorber at the top. 
Due to gravitational red shift there was no absorption. However, by 
moving the emitter with a relative velocity to compensate for the 
gravitational red shift, resonance was restored. And this led to the 
experimental value of 


Av 


Vv 


= 2.56 x 10°). (6.90) 


EXPT 


On the other hand, theoretically we can calculate (see (6.85)) 


Av 1 1 
=| = [Aol = Gye (=~ =) 
Zi TH TL r2 
A 
= GyMp = = 2.46 x 107}, (6.91) 
EB 


Here we have made the approximation 


rirg = Rf, (6.92) 


and used Ar = rg — rT; = 21.6 m (72 ft). This shows extremely good 
agreement between the theoretical calculation and the experimental 
measurement. 


> Example (Stereographic projection of unit sphere). Let us consider a two di- 
mensional space described by the line element 
4 Ar? 
dr? = ——— dr? + ———— d¢ 6.93 
= Guepae 4 (rapa O28) 
where r € [0, co] and ¢ € [0,27] with ¢ = 0 and ¢ = 2zn identified. The metric 
tensor follows from (6.93) to have the diagonal form 


4 _ 4r? 
Grr = (1+ r2)2’ Iee — (1+ r2)2’ 
Pie 1 2\2 og 1 2\2 
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The proper length (time) between the points A and B along a path is given 
by 


TB TB 
T= far = J (aerar’ + gosdd?)?. (6.95) 
TA TA 


For a circle at r = Ro we can identify the two points A and B to correspond to 
ga = 0 and ¢p = 2n with r = Ro fixed. In this case, (6.95) leads to 


20 


r= | 46 Vae0(F) 


| 
ose ° 
Qu. 
Q 
i) 
xy 
Oo 


= = (6.96) 


Expression (6.96) gives the length of the circumference of the circle r = Ro and 
shows that 


lm +r 0, (6.97) 
Ro—0o 


so that the region described by Ro — oo actually corresponds to a point. 
As we have seen, the action (6.50) (a dot denotes a derivative with respect 
to T) 


1 F : 
s= far 5 (Gert? + 9999") 


leads to the geodesic equations through the Euler-Lagrange equations. The Euler- 
Lagrange equations (see (6.55)) for r and ¢, respectively give 


. Wr i. r(l—r?), 
fr — 
1+r? 14+ r? 
d Ar? 
ae (are) =" = 
We conclude from the second equation in (6.99) that 


4rd 


(4122 = const. (6.100) 
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The region r — oo, as we have seen in (6.97), is mapped to a point and 
all other points seem to be regular. This is, in fact, nothing other than the unit 
sphere. To see this we note that if we define 


r= tan 7 (6.101) 
with @ € [0,7] then, 
29 


0 at 
dr=d (tan 5) = 55 5 dé. (6.102) 


It follows now from (6.93), (6.101) and (6.102) that 


dr = dr? 2 
t= pee eae 
Atan? 2 
= —, sectS ae? 4 AE ag? 
sect $ 2 sect 
= dé” + sin? 6d¢?. (6.103) 


This is nothing other than the line element on a unit sphere. We can view the 
original coordinate as representing the stereographic projection of the unit sphere 
with the azimuth angle ¢. 

< 


6.4 Twin paradox and general covariance 


In special relativity we know of the following paradox involving a pair 
of identical twins. If one of the twins stays on the earth while the 
other travels in a space craft then for the twin on earth, the partner 
is in a moving frame and hence his biological clock would slow down. 
Therefore, when he comes back to earth, he should be much younger 
than his twin partner on earth. On the other hand, in the rest frame 
of the twin on the space craft, the partner on earth is moving and 
hence his biological clock should slow down. Consequently, upon 
return the twin travelling in the space craft should find the earth 
partner to be much younger. 

This, therefore, poses a problem and is referred to as the twin 
paradox. We can see now how the resolution of this puzzle (paradox) 
comes simply from considerations of general coordinate transforma- 
tion. To simplify the problem let us assume that there exists only 
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one space and one time dimension. Let us further assume that in the 
rest frame of A, the position of B is given by 
tp = f(t). (6.104) 


Then in the rest frame of B, the position of A is given by 


X=@-f,=2- f(t). (6.105) 
Let us further assume that both observers use the same coordinate 
time, i.e., 

t=T. (6.106) 
Then the invariant length interval (c = 1) in the frame of A has the 
general form 

dr? = gua dx? + 2gudardt + gydt?, (6.107) 


whereas in the frame of B we can write 


dr? = gxxdX? + 2gxrd XT + grrdT’. (6.108) 


From the transformation relations (6.105) and (6.106) between 
the two coordinate frames we note that 


dr? = g¢¢dX* 4 26570 XdT + Ge dT” 

= gee (daft) + Voy a(de— fdt\di ond 
gxxdx? + (29xr — 29xxf)dadt + (grr — 2gxrf + gxxf?)dt? 
= Gprdx? + 2gn4dadt + gydt?. (6.109) 


This leads to the relations between the metric components in the two 
coordinate frames as 
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Gear = GX X; 
Jet = QGxr- Geet, 
Kt = Grr- ose al Geet (6.110) 


which can also be obtained from the transformation of the covariant 
metric tensor (3.17) under the coordinate transformations (6.105) 
and (6.106). Conversely, we can invert the metric components in 
(6.110) and write 


9xx = Jax) 
9xr = Got+ Gua, 
grr = gn t2(Gert+ Graf) f — Goaf? 
= gn+2etf + Gecf?. (6.111) 


Let us now assume that the take off time of the space craft is 
labeled by t; = T) and that the time of return by to = 75. Thus the 
age of B in the rest frame of A is given by 


14(B) = far 


ty 


to ao 
dxp dxp ate 

— 29x Ee cx 
fo (ae Get | + 9 (=) 


t1 


tg 
. . ab 
= fe (gt + 2Getf + Onit)? 
ty 


T2 
=> fer V OTT: (6.112) 
Ti 


On the other hand, the age of B in his own (rest) frame is given by 
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T2 
ra(B) = f ar VGrr> (6.113) 
Ti 


which shows that 


74(B) = T,(B). (6.114) 


In other words, the age of a person is independent of the coordinate 
system. It is also clear that the age difference of the two persons is 
unique and is given by 


t2 


Ta(B) — T4(A) = pu (/Grr — 9%) =Te(B)—T(A), (6.115) 


ti 


so that there is really no paradox involving (the frames of) the twins. 


6.5 Other equations in the presence of gravitation 
We have already seen that the free particle equation in the presence 


of gravitation is given by 


d2xt _ ‘i dx” da* 
ue dr? aa dr dr 


= 0. (6.116) 


In addition, if the particle experiences an external (nongravitational) 
force f¥, then the equation of motion would modify to 


dat ii dx” da* 


_ — — fl Ll 
we dr? ae) dr dr I (6.117) 


where f" represents the external force four vector. As in the case of 
flat space-time (see (2.9)), the force four vector (acceleration) has to 
be orthogonal to the proper velocity four vector, namely, 


Ups” = guwu" fY = 0. (6.118) 
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There are various ways of seeing this, but probably the simplest 
is to note that the equation (6.117) can be written in terms of the 
proper velocity as 


Du" 
re Dr 


which leads to 


= ft, (6.119) 


D 
m D(guvubuY) 5 
or, 5) D7 = Gu ie 
m (guy uhu ) Lb fv 
— — — 2 a: 
or, 5 ae O=g.0u" (6.120) 


Here, in the intermediate steps, we have used the metric compatibil- 
ity condition (5.8) as well as the normalization of the proper velocity 
(2.8) in the curved manifold (5.124), namely, 


Juhu’ = 1. (6.121) 


The electromagnetic theory or Maxwell’s theory (with sources) 
can also be extended to curved space in the following way. We know 
that in flat space, Maxwell’s equations are given by 


Oye a ian On Pos = 0. (6.122) 


In the presence of gravitation the first equation can be written in the 
covariant form 


Depe? = (6.123) 


where (the connections do not vanish for the contravariant field 
strength tensor unlike in the case of the covariant tensor in (4.189)) 


Fey — DH AY — DY Al = — YH, (6.124) 


6.5 OTHER EQUATIONS IN THE PRESENCE OF GRAVITATION 195 


Furthermore, we know that (see (4.162)) we can write 


D, FH = = a,(V=9 FH”), (6.125) 


so that the first equation of Maxwell becomes 


D, FY = J", 
1 
— 0,(J/-g F"”’) = J, 
J/-9  ( g ) 
or, On(/—g FH’) =V/-g J”. (6.126) 


The second equation (the Bianchi identity) takes the covariant form 
(see (4.190) ) 


or, 


Dy Fux = OF Lx = 0. (6.127) 


Thus, Maxwell’s equations (with sources) in a curved manifold 
(or in the presence of a gravitational field) can be written as 


Vv—-g FM) =V—g J", Oy Fy) = 0. (6.128) 


Furthermore, from the first equation we see that 


OO, /=G FH”) = O(V=G J”) =0, (6.129) 


which follows from the anti-symmetry of the field strength tensor. 
Therefore, we see that, in the presence of gravitation, current con- 
servation takes the covariant form (see (4.150) and compare with 
(2:75) ) 


Oul/—g JY) = /=G DJ" = 0. (6.130) 


It is now obvious that we can define a global charge associated with 
the system which is conserved. Namely, if we define 
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‘j= [ee J/—g J, (6.131) 


then, it follows that 


oe = f ate ov do(./—g J°) = -[@ t 0;(./—g J*)=0, (6.132) 


where we have used (6.129) (or (6.130)) and have also assumed that 
the current vanishes asymptotically. 

Let us now examine the stress tensor of a particle. We have 
already seen that if a particle is subjected to an external force field 


then we have to define a total energy momentum (stress) tensor as 
(see (2.128)) 


Thetal = = tte a se (6.133) 


and it is the total stress tensor which is conserved in flat space, 
namely, 


OnT ictal = 0. (6.134) 
In the presence of gravitation this equation should be covariantized 
which leads to 


Dickes 


total 


= 0, 


Ot, Og eS = ere 0, 


total Lup~ total Lup~ total 


1 
or, OT; iam + Vo Oo(/-9)T etal = = 1 aye oe aih 
OF; On (VW Vv G1) 3) = ae (6.135) 


% 


Thus we see that unlike the charge current, the total stress 
tensor for the matter and the external fields does not satisfy a con- 
servation law. Consequently, we cannot define a global momentum 
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from ./—g T, a which would be a constant in time. In other words, 
if we define 


0, 
Protal = / d®x (V=9 Tea) (6.136) 


then it follows that 


dP etal 
orca # 0. (6.137) 
This difference from flat space is understandable since the total mo- 
mentum that would be conserved has to include the momentum as- 
sociated with the gravitational field as well. However, we do not 
yet know how to construct the energy momentum associated with 
the gravitational field. This is because so far we only know how to 
extend quantities from flat space to a curved manifold. However, 
in a locally flat Cartesian coordinate frame the gravitational energy 
would vanish since it would be constructed out of derivatives of the 
metric tensor which is constant in the local frame. This, therefore, 
leads to difficulties in defining a global conserved momentum for a 
gravitational system. However, let us note here that for an isolated 
(closed) system it is always possible to define a global momentum 
which is conserved. We would come back to this question later. 
As another example of a dynamical equation, let us consider the 
relativistic Klein-Gordon equation for a scalar field in flat space-time 


(0 +m?) 6(z) = (0,0" +m”) d(x) = 0. (6.138) 


In the presence of gravitation this generalizes to 


(D,,D" +m?) o(x) =0, 
or, D,0"b(x) +m? d(x) =0, 


Fs 8 (V=G I" H(e)) + mola) =0, 


or, On. (/—g 0" d(z)) + V—G md(z) = 0. (6.139) 


or, 
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This discussion shows how any equation in flat space-time can be 
generalized to a curved manifold. (Generalization of equations in- 
volving fermion fields needs the concept of tetrads or vierbeins that 
we will not go into.) 


CHAPTER 7 


Curvature tensor and Einstein’s equation 


7.1 Curvilinear coordinates versus gravitational field 


A natural question we can ask at this point is how we can distinguish 
between a curvilinear coordinate system (in a flat space) and a grav- 
itational field. Quantitatively we may ask if we make a coordinate 
change such that 


Of Og? 
"ww = De Oar Ip), (7.1) 


and g,,(x) has a nontrivial form, how can we distinguish this metric 
from a genuine gravitational metric. The answer lies in the fact that 
in such a case, given g,,,(a) we can always find a coordinate frame in 
which the metric becomes the flat Cartesian metric. Correspondingly 
in that frame the curvature tensor will vanish identically. Namely, 
in that frame 


Bes ts), (7.2) 


and since this is a tensor equation it must be true in any coordinate 
frame even if the metric has a nontrivial form. Consequently the 
manifold must be flat. On the other hand, the Riemann-Christoffel 
curvature tensor will not vanish in the presence of a genuine gravi- 
tational field. 


7.2 Definition of an inertial coordinate frame 


We have seen that in any coordinate frame the metric must satisfy 
the metric compatibility condition (4.103) 
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Dugur = OuGur 1 Div Ipr + Diy 9up = 0. (7.3) 


Furthermore, we have seen (see the theorem following (4.31)) that 
given a symmetric connection, we can always make a coordinate 
change to bring the connection to zero at a particular point. Thus we 
see that in addition to making the metric flat at some point we can 
also make the connection vanish at that point through a coordinate 
transformation. Mathematically this is equivalent to saying that at 
XL = Lp we can have 


Juv (Lp) = Nuvs OpGuv |p—ap = 0. (7.4) 


A frame where (7.4) holds is known as an inertial frame. However, we 
note that this does not say anything about the higher order deriva- 
tives of g,, at 2 = Xp. In particular, 


Or Opis paoer 0 (7.5) 


if the manifold is curved. On the other hand, if we go back to the 
definition of the curvature tensor, we note that 


Vy. 


Re" do ~ Ag Organ es (7.6) 


so that with a coordinate change we cannot rotate away the curva- 
ture. However, over infinitesimal distances the effect of curvature 
will be negligible. 


7.3 Geodesic deviation 


We have already seen that a free particle follows the geodesic path. 
Let us now consider a region of space containing a number of geodesics. 
Let us assume that the family of geodesics is parameterized by the 
proper time 7 as well as by a second parameter fh which distinguishes 
between the different geodesics. The geodesics, of course, obey the 
equation (see (5.83)) 


07x" (7, h) Ox” Ox* 
ee ae ee 
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If we define the tangent vector as 


u(r, h) = (78) 


then the geodesic equation can also be written as (see (5.86)) 


Duh = Ou 
he = Oe = Pear =0. (7.9) 


Let us now define a second vector associated with the trajecto- 


ries as 


be 
vir) = (7.10) 


This describes the difference in the coordinates of two infinitesimally 
close geodesics (for the same value of 7). Clearly from the commu- 
tativity of the ordinary derivatives we have 


He 2 abt u 
Ov Ox Ou (7.11) 


‘Or OrOh Oh’ 


Therefore, we can write 


Dut Ou ORY r 
Drs Or eae) 
L 
= an —T4, ue, (7.12) 


which leads to 
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D2vl =, 0 Dor i Ox? Dv? 


Dr? dr Dr"? dr Dr 


po Oh os 
O Out Ou” 
= aac Be o,v,Xr Th ee Xr 
a) OsT uuu vA Br v 
fa) Xr Ou? 
—Ti,u” S— — Thu? + Te Te uu’. (7.18) 


Using the geodesic equation (7.9) we can write 


D?v# 2) 

7 = oh (Te ur) oar OgT Hur u’ ur 

Our 
o\ AX 

-l) (Ty uu Ju = A ae Oh 
+TM Te uu’ 


O,T Hv? uur — OoT hur u’ ur 


d .. 
STE pert O° ae Re rare 


= (M4, —9,0%, —Te, 1s, + P40) ueu’® 


2 

or, — = Ree, (7.14) 
Thus we see that the second derivative of v“ is proportional to the 
Riemann curvature tensor. Therefore, the separation between the 

geodesics changes (as 7 changes) if the curvature does not vanish. 
This shows that in the presence of a gravitational field a group 
of geodesics do not remain parallel. This is known as the geodesic 
deviation and we immediately see that by studying the deviation of 
the geodesics we can detect the presence of a gravitational field. This, 
of course, does not contradict the principle of equivalence because 
if we restrict ourselves to an infinitesimal volume in the manifold, 
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then the term on the right hand side of (7.14) will be negligible and 
hence we cannot detect the deviation in the geodesics in such an 
infinitesimal region (locally). 


7.4 Properties of the curvature tensor 


The Riemann curvature tensor is a rank four tensor and hence has 
4* = 256 components. However, not all of them are independent or 
nontrivial. This is because the curvature tensor has many symmetry 
properties. For example, from the defining relation (see (5.17) ) 


[Du D,\AP = Seen (7.15) 
we note that 


Res He 


op ov" 


(7.16) 


That is, the curvature tensor is anti-symmetric in the last two indices. 
Furthermore, let us consider the scalar length of a vector €“ defined 
by 


o= Eue® = GEE". ae) 


This leads to 


D\Dpe = Dy (29yv (DpEé") £") 
29uv ((Dx DoE") &” + (Dp&") (Dy€")) , (7.18) 


where we have used the metric compatibility condition (7.3). 
It now follows from (7.18) that 


[D,, Dp|¢ = 290 (DX, DzlE” 
= 29 ue Revo 
26" E° Rucdp- (7.19) 
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On the other hand, we know that since ¢ is a scalar function (see 
(4.179)) 


[Dx, Dold = (0a, Op] = 9, (7.20) 


so that, comparing with (7.19), we have 


Dee: Boeke = 0. (7.21) 


Since this is true for any arbitrary vector €" in an arbitrary curved 
manifold, this implies 


Ruerp oa. —Repvrp, (7.22) 


and we conclude that the Riemann-Christoffel curvature tensor is 
anti-symmetric in the first two indices as well. 

Furthermore, let us take an arbitrary vector €, and construct 
from it the completely anti-symmetric third rank tensor 


Etuv:d) = (Suvir — Eu:sw) + cyclic permutation of uv 


[D,, Dv], + cyclic permutation of pvr 
= —R*,)/$o + cyclic permutation of pv, (7.23) 


where we have used (5.19). On the other hand, we also know that 
(see (4.181)) 


ire = 0. (7.24) 


Therefore, since €,, is an arbitrary vector, it follows from (7.23) that 
(it can be checked easily that the cyclic permutation of pA is the 
same as the cyclic permutation of Av) 


R/$ + cyclic permutation of pAv = 0, 


or, R*.y, + cyclic permutation of pAv = 0, 


or, Roprw + cyclic permutation of wAv = 0, (7.25) 
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which can be written explicitly as 


Ror + Rov + Rovpr = 0. (7.26) 


Alternatively, using the anti-symmetry property in the first two in- 
dices (7.22), we can also write this relation as 


Ryo a Ryoup =F Ryopr = 0. (7.27) 


With these relations, let us now look at the combination of the 
curvature tensors 


Ruwrp - Rp. (7.28) 


Using the cyclic identities (7.26) and (7.27), we determine 


Rie — Ripe = = Ege + Rion) +S Ry Bop) 
= —Ryrpv + Ropru 
os (Rorvp zs Ry up) a (Rorup + Rippr) 
— (Ryvrp — Rap) = 9. (7.29) 


Since this is a tensor equation it follows that 


Ruvrp — Rppv = 9, 
or, Rywrp = Rypuv- (7.30) 


Parenthetically we note that this relation can also be derived simply 
from 


Rwrp = —Ryrpv — Ruprr 
= —Ryrpy + (Ropru + Rap); 
Rwrp = —Rrvpp — Rovpr 


= (Rrpuv a Ryyvp) = Rovyr- (7.31) 
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This leads to 


Rudpv _ Rovur = 0, (7.32) 


which coincides with (7.30). 
We can now collect all the algebraic properties of the curvature 
tensor as (see (7.16), (7.22), (7.26) and (7.30)) 


Rywrp = —Ruvor = — Rupr = Apu, 


Rywdp + Rydrpv IF Rupvr = 0. (7.33) 


This reduces the number of independent components of the curvature 
tensor to 20 (36 — 12 — 4 = 20). This is still a large number of 
components since we know that they have to be constructed out of 
ten components of the metric tensor. In fact, these 20 components 
depend on the choice of the coordinate system used. 

Let us recall that the curvature tensor is defined as (see (5.18)) 


TR xcs a Oly _ ON. a ie Dee ae ela ee, (7.34) 


with the connection given by the Christoffel symbol (4.49) 


1 
CAS = 5 IF" (A920 + OGov — Aa Gur): (7.35) 


In a locally flat Cartesian coordinate system we have (see (7.4)) 


Guy = Muvs Oo Iv = 0. (7.36) 


Therefore, in this coordinate system, we can write (the connection 
vanishes in this coordinate system) 
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1 
RN 9 ae (OpO.9Xo + OO Gov = Opa Gur 


= OO, Gpo = O\05Gov + O)959up)s 


1 
or, Ruvrp = ~5 (QnA Oru = OpOn Gur + OvOnGup = OVO Gon) 


_ -5 (Op Ory = OpOn rv + OVO 9up ax OO, Gpp)- 
(7.37) 
Written in this form the symmetry properties of the curvature tensor 
(7.33) are manifest and they continue to hold in any coordinate frame 
since the symmetry properties are respected by a general coordinate 
transformation. 

Let us also note that we can obtain other symmetry relations 
of the curvature tensor which are not algebraic in nature. These are 
known as the Bianchi identities. (See (4.190) for the Bianchi identity 
in the electromagnetic theory.) To derive these, let us consider a 
general vector €,. From (5.19) we know that 


[Dy Dol&u = Euslosa] = Eee s (7.38) 


Let us further differentiate this covariantly which leads to 


Giittoakher = Fiwioiee” a Rywrpb sa (7.39) 


On the other hand, we note that (see (5.21) and note that the metric 
tensor is covariantly flat) 


Eusos[o,A] = [D), Dol&uic 
Ryvrp ae ak Revrp ae (7.40) 


If we anti-symmetrize both sides of (7.39) with respect to pAo we 
obtain 


Eu:[p:dio] = Suslp;aj;o + cyclic permutation of pA 


= [Ripe a Ringers + cyclic permutation of po, 
(7.41) 
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while the anti-symmetrization of (7.40) leads to 


Eu:[o:A;0] = Suso;[p;A] + cyclic permutation of pA 
— Bite ecen + Rie. + cyclic permutation of pAc. 
(7.42) 


The second term on the right hand side of (7.42) vanishes because 
of the cyclic identity (7.27). As a result, from (7.41) and (7.42) we 
obtain 


Euslo:dso] = RuvrpjoS” + RuvrpS”.6 + cyclic permutation of pro 
Rywrp$'.g + cyclic permutation of pAc, (7.43) 


which yields 


Rwvrp:06" + cyclic permutation of pA = 0. (7.44) 


Since this must be true for any arbitrary vector €”, we conclude that 


Ryvrp;0 + cyclic permutation of pAc = 0. (7.45) 


This is known as the Bianchi identity (in gravitation) and is quite 
useful in the study of gravitation as we will see. 

Let us note here that we can contract various tensor indices of 
the Riemann-Christoffel curvature tensor to obtain other curvature 
tensors of lower rank. However, because of the anti-symmetry of the 
curvature tensor in various indices, the number of contractions we 
can make is limited. Let us define 


Rurx = GF? Rwrp = BR = R, (7.46) 


v 
pv pb vA" 


It is clear from the definition in (7.46) that 


Run = Titan =. Sty = rane (7.47) 
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where we have used the symmetry properties in (7.33). The second 
rank tensor R,, is known as the contracted Riemann curvature or 
the Ricci tensor. It is symmetric in the two indices as can be seen 
from 


Ry = GF? Ryvup = GF? Rupr = Ry, (7.48) 


where we have used (7.30). Therefore, the Ricci tensor is a second 
rank symmetric tensor with only ten independent components (ex- 
actly like the metric tensor which is also a second rank symmetric 
tensor). Furthermore, if we contract the indices of the Ricci tensor, 
we obtain the Ricci scalar curvature, namely 


R= gt Rix: (7.49) 


Note that this is the only nontrivial scalar curvature that we 
can construct from the Riemann tensor. The only other possibility 
which can be constructed using the Levi-Civita tensor vanishes, 


OE =O, (7.50) 


because of the cyclicity identity (7.26) satisfied by the Riemann 
tensor. 

Let us now examine the consequences of the Bianchi identity 
(7.45) for the Ricci tensor. To see this, we recall that 


Ryvrp;o0 + cyclic permutation of pAc = 0. (7.51) 


Since the metric is flat with respect to covariant differentiation (4.103), 
we now obtain 


7” (Ruvrp;0 + cyclic permutation of pAc) = 0, 
or, oO (Ravine + Rywvpo;r an Tisiakte) ae 0, 


or, Ryrio — Ryuoix — RP = 0. (7.52) 


poA;p 
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Contracting further the indices yw and X this leads to 


gt (Rao _ Ryo:d = is) = 0, 


Ory. < Hage py Rs = 0, 


1 
or, RY. —-= Rv =0, 


VY 1 Vv 
Oty. Pp 5 RY =0. (7.53) 


If we define a second rank symmetric tensor from the Ricci tensor 
(7.46) and the Ricci scalar (7.49) as 


1 
Gt = RY gl R, (7.54) 


then, it is clear from (7.53) that we can write 


? 


1 
DyGt? = RM, — 5 RY =0, (7.55) 


which shows that this tensor is conserved in a curved manifold. G*” 
is known as the Einstein curvature tensor and with these preparations 
we are now ready to discuss Einstein’s equations. 


> Example (Symmetry in inertial frame). We recall from (7.37) that in an iner- 
tial frame defined by (7.4) 


Guv = Nuv, Oo Guv = 0, (7.56) 


so that in this frame we have 


i, (7.57) 


we can write 


1 
Ryvrp = 75 (OpOvgru — ApOAngrv + O:Ougup — OrOvGup)- (7.58) 
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Since the metric tensor is symmetric and the derivatives commute, namely, 


Juv = Guus OwOv = OvOp, (7.59) 


it follows from the definition in (7.58) that (see (7.33)) 


Ryvrp = Ruprp = Ryvpr — Ryppv- (7.60) 


Similarly, from (7.58) we also obtain 


Ryvrp + Rurpv + Rupvr 
1 
= ~95 (2,0.9.1 — OpOngrv + OOnGup — OrOv Gup 


+ OvOGpu — OvOnGor + OpOpgav — OpOGuv 


V8 ts DOG Oso vApgur) 


=0. (7.61) 


Finally, we note that since in the inertial frame the connection vanishes (see 
(7.57)) we have 


Ruyuvrpic im Oo Ruvrp, (7.62) 


which leads to 


Ruoree + Ruvpo:r + Ruvorp 


= —5 (260,094 — Apugrv + Or9h.4-» ~ Ox2vIup) 
+O (Oc Ov 9 pu — Oo On Gov + ApOnGue — OpAvGuc ) 
+ 9p(O.9.Gon — OOnGov + AxOnGur ~ Oe 9.94r)) 
=0, (7.63) 


which coincides with (7.45). Thus, the symmetry properties of the curvature 
tensor are easily checked in a locally inertial frame and being tensor symmetries, 
they hold in any frame. 

< 
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> Example (Curvature in two dimensional space). Let us consider the two di- 
mensional Euclidean space. In two dimensions R,,,, must have the symmetries 
already mentioned in (7.60) and (7.61) which implies (in two dimensions) 


Riis = Riiz2 = Riiiz = Ree = Re212 = Re222 = 0, (7.64) 


by antisymmetry (for example, Riii2 = —Riii2 = 0). Therefore, the only non- 
trivial component of the curvature in two dimensions, from symmetry considera- 
tions, follows to be 


Rizi2 = Raia = —Raii2 = —Riza1. (7.65) 


This is in agreement with the fact that there is only one linearly independent 
term in Ryvr, in two dimensions. We can relate this curvature to the Gaussian 
curvature discussed in chapter 1 (see (1.19)) in the following way. 

From the definition of the Christoffel symbol (4.49) we can calculate and 
obtain 


1 1 
[iy = g Agu 5 g° (201912 — 02911), 


1 1 
Tis = Th = “3 g' dogit ial 59 Oig22, 


1 1 
hes 5 g'! (202912 — O1922) — 5 g'? 02922, 


1 1 
ri, = 5 g' Ogu —35 g”° (201912 — 02911), 


1 1 
Tis = ion = ~3 g Oogir a 3 9 A192, 


1 1 
Too = 5 gg” (202921 — O1g22) — 3 Gg? 82g22. (7.66) 


The Riemann-Christoffel curvature tensor can be calculated from the definition 
in (5.18) as well as 


Ryuvrp ca GuoR rnp: (7.67) 


However, we note that in two dimensions, it is much easier to calculate 


gRyvro, (7.68) 


where g denotes the determinant of the metric tensor. 
In two dimensions, the determinant of the metric tensor has the simple 
form 


9 = 911922 — 912921; (7.69) 
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which leads to 


97 =922, 99" =9u, 99" =99"' =—Gi2. (7.70) 


In deriving (7.70) we have used the two dimensional identities (g"*g,, = 64) 


11 12 

g git+g gn =1, 
22 21 

g g2+g9 g2=1, 


11 12 
g gi2+ 9 g22 =0, 


gon + 921 =0. (7.71) 
With this, we can now calculate (7.68) which leads to 
gRi2i2 = Z (20102912 — OF gir — Oi g22) 
= e [A191 (202912 — O1g22) — (d2911)"] 
gi2 
+ ie [0191102922 — 20291101922 + (201912 — 02911) (202912 — O1922)] 
- a [02922 (201.912 — 02g11) — (01.922)"] 


= 9K, (7.72) 


where K is the Gaussian curvature defined in (1.19). Therefore, we conclude 
that, in two dimensions, 


1 
Kk == Rio, (7.73) 
g 
is the unique curvature tensor. Alternatively, we can write 


Rioiz = gk, (7.74) 


and keeping in mind (7.69) we recognize that in two dimensions (7.74) allows us 
to write the Riemann-Christoffel tensor as 


Ryvrp = K (gurgup — GupGrr)s (7.75) 


which has all the symmetries (7.64) and (7.65) manifest. Furthermore, from 
(7.75), the Ricci tensor follows to be (in two dimensions) 


Ruy = 9°? Ruove = K Gy, (7.76) 
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leading to the Ricci scalar 


R=g"” Ry =2K. (7.77) 


7.5 Einstein’s equation 


In the last chapter we extended equations for various particles and 
fields to a curved manifold (in the presence of gravitation). This 
involved simply taking the relativistic dynamical equations in a flat 
manifold and covariantizing them. A natural question that comes 
to mind is, of course, what is the equation for the gravitational po- 
tential itself. This is particularly interesting because we know the 
equation for the gravitational potential only in the non-relativistic 
limit. Classically we know that the static gravitational potential sat- 
isfies the Poisson equation. Namely, if ¢(x) denotes the gravitational 
potential, then it satisfies the Laplace equation 


V70(x) = 0, (7.78) 


in regions free of gravitational sources (masses). However, if there is 
a distribution of (non-relativistic) matter in a given region, then the 
gravitational potential satisfies the Poisson equation 


V°o(x) = 4nGyp(x), (7.79) 


where Gy is Newton’s constant and p(x) describes the matter density 
in the given region. In CGS units, Newton’s constant has the value 


Gy & 6.67 x 10-8cm?g~tsec?. (7.80) 


Equation (7.79) can be seen easily to hold by noting that the gravita- 
tional potential for a continuous distribution of matter can be written 
as 


eres / 3x! a (7.81) 
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so that 


V*o(x) 


1 
_ 3! / 2 
Gy f ax p(x’) V (—) 


—Gy / d?x’ p(x’) ( — 4163 (x — x’) 


= 4nGyp(x). (7.82) 


We have also seen in (6.21) that in the weak field limit, we can 
identify 


goo(x) = 1+ 26(x), Gij = Nhj- (7.83) 


Thus if we assume that the metric tensor g,,, represents the gravita- 
tional potential, then the equations for g,,,, must involve second order 
derivatives of the metric and must reduce to the respective equations 
(7.78) and (7.79) in the appropriate limit for the two cases (matter 
free and in the presence of a distribution of matter). Furthermore, 
we are looking for at least ten equations since g,, has ten compo- 
nents. These must be tensor equations since g,,, is a tensor and the 
equations are expected to be covariant under a general coordinate 
transformation. 

Let us now look at the geodesic deviation equation which we 
have derived in (7.14) 


D2! , 
De = Re pnt ulrr, (7.84) 
where 
Ox Ox 
fn BE fe 
U ae Dh (7.85) 


Infinitesimally, therefore, we can write down the geodesic deviation 
as satisfying the equation 


D2 6a! aw ae dx” dx? 


vA _ Xr 
pee — vor Ga ade 6x = 105 6x ; (7.86) 
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where we have identified 


da” da? 

HH _ pH 
Let us now consider the equivalent problem in classical Newto- 
nian physics. Let us assume that a particle of unit mass is subjected 
to a static, space dependent gravitational force. In this case, from 


Newton’s equation (see (6.6)) we have 


d?x' 
dt? 


=- d¢,- (7.88) 


Similarly, a nearby geodesic (trajectory) satisfies 


ar =~ O4ls- (7.89) 


If these trajectories are infinitesimally apart then we can write 


éx° = #' — a", (7.90) 


and 


d'd(x)|. = d'6(x)|, + 5x) 0'O;0(x)|, + O(dx7), (7.91) 


x 


which leads to 


d2z = d?a* 
dt? dt? 
d?dx* 
dt? 


= — 0'9(x)|,, + O'9(x)|,, 


or, (7.92) 


~ — 62) 8'0;6(x)| 


oe 


Comparing this with the geodesic deviation equation (7.86) in the 
curved space we can identify 


Ki, = —0'0;¢. (7.93) 
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We know that in empty space we can write (see (7.78)) 


V°o =-6'0;¢ = K*, = 0. (7.94) 


Therefore, we feel comfortable in generalizing this equation to curved 
space, free of matter, as 


dx” dx? 


H _ Pe oe ae 
= Roy je ae 0, 
dx” dx? 
or, —Rup A ae = 0. (7.95) 


. : : be 
Since this must be true for any arbitrary vector a we have 


Rip =0, (7.96) 


in a region free of matter (gravitational source). 
In other words in empty space we can write the gravitational 
field equations as 


Ry =0. (7.97) 


This is a tensor equation and since R,,, has ten components, there are 
ten equations. Furthermore, the curvature involves second deriva- 
tives of the metric tensor and hence it is plausible that this is the 
correct equation in the case of empty space. We also note here that 
the more stringent equation 


RY =0, (7.98) 


involves more than ten equations and leads to the trivial flat space- 
time solution for the metric and, therefore, has to be rejected. 

To derive the equation in the presence of matter, let us note that 
the classical non-relativistic (static) equation (7.79) can be written 
in the weak field limit in the form 
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V2 = 4nGyp, 
or, V°goo = 87GyToo. (7.99) 


Therefore, we recognize that generalization to curved space-time 
would involve the stress tensor on the right hand side of the equation 
(as source for the gravitational field). As we have seen in (2.113), 
T*” is a symmetric second rank tensor and hence it is tempting to 
generalize (7.97) to the case of non-empty space as 


RY = aT”, (7.100) 


where T“” is the stress tensor (associated with matter including ra- 
diation and other non-gravitational fields) and a is a normalization 
constant to be determined. This is a genuine tensor equation which 
reduces to the free space equation (7.97) in the source free case 
when 


TH = 0, (7.101) 
However, this equation suffers from the defect that it is not a con- 


sistent equation. That is, whereas the stress tensor is conserved (see 
(6.135) ) 


D,T"” =0, (7.102) 


the Ricci tensor is not in general 


DRY £0. (7.103) 


Historically, equation (7.100) is what Einstein had suggested 
originally to describe the dynamics of the gravitational field. How- 
ever, very quickly he as well as Poincaré, independently, realized the 
inconsistency in the equation and generalized the equation to 


Gl = aT”, (7.104) 
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where G“” is the Einstein tensor defined in (7.54). As we have seen 
in (7.55) 
1 
D,G” = D, (Re -5 yk) =0, (7.105) 


so that the equation (7.104) is at least consistent. Furthermore, we 
note that in empty space (7.104) reduces to 


Guy = Rey — 5 g'¥R=0. (7.106) 


This seems to be different from (7.97), but we note from (7.106) that 


4 
IwG = R-> R=0, 
of; R=: (7.107) 


Hence using this in (7.106), we obtain 


Guy = RM = 0, (7.108) 


in the case of empty space. Therefore, in the source free limit, (7.104) 
indeed reduces to the free space equation (7.97). 

The constant a in (7.104) is a normalization constant to be 
determined shortly. We note that the field equations also lead to the 
relation that 


1 
Iu Gl = Juv (Re a #"R) = egy l , 


4 
or, R- 3 R=al, 
or, R=-alT, (7.109) 


so that we can write the Einstein equation (7.104) also as 


RH =a 6 = 5a” r) (7.110) 
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Here T = T‘, denotes the trace of the stress tensor. Furthermore, 
the stress tensor is that associated with matter particles and fields 
(not including gravitation). 

Let us now see whether the Einstein equation (7.104) (or (7.110)) 
reduces to Poisson equation (7.79) and if so what is the value of the 
normalization constant a for which this would be true. To simplify 
the derivation, we assume that matter consists of static dust parti- 
cles (without any radiation field) for which we can write the stress 
tensor as 


__, aot da" 
ae ae 


Te = pore: (7.111) 


Since the dust particles are static (t = 7), then it follows that 


T™ = p, 
Pea 0. 
T = Gy,T = gooT” = goop = P. (7.112) 


Here we have assumed that the metric changes only slightly from the 
Minkowski metric and correspondingly have kept only the dominant 
term in the trace in the last relation in (7.112). 

Assuming that the gravitational potential is weak, we can ap- 
proximate the Riemann curvature (5.18) as 


Bg ORL OE G s (7.113) 


so that the Ricci tensor (7.46) takes the form 


Ripa OT =O.04,. (7.114) 


Furthermore, since only Too is nonzero (see (7.112)), it is sufficient 
to look at the Rog equation in (7.110) to determine the constant a. 
From (7.114) we have 


Roo = O01, — uV b0- (7.115) 
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However, since we are assuming the gravitational potential to be 
static, the metric tensor would not involve time and we can write 


Roo = —d,0 4b = —O:T bo 
= —0i | —5 9°" (20905 + A095 — Oj 900) 
= Sy 0:0" goo = 5 V~900- (7.116) 
Furthermore, we recall that for a weak field, we can write 


goo = 1+ 26, (7.117) 


where ¢ is the gravitational potential so that the field equation in 
(7.110) leads to 


1 
R® ~ Rog = a (7 5 sr) ; 
or : V7(2¢) =al p- = 
, 9 7 p a 9 P; 
or, Wdo= 5 p. (7.118) 
Comparing this with the Poisson equation (7.79) 


V7¢ = 40Gyp, (7.119) 


we determine 


a= 81Gy, (7.120) 


so that the field equations (7.104) and (7.110) in the presence of 
matter can be written as (Gy is Newton’s constant) 
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Gy 


1 
(Re Bat, g""R) = 89GyT"”, 


1 
or, RM” = 8nGy (7 a5 v7) ; (7.121) 


Both forms of these equations are known as Einstein’s equations and 
it is clear that they relate the distribution of matter and energy to 
the curvature associated with the space-time manifold. 

Let us note here that different authors use different conventions 
for the metric as well as for the definitions of the Christoffel symbol, 
Riemann tensor, Ricci tensor etc. Consequently, the sign of the term 
on the right hand side of the Einstein equation (7.121) depends on 
the convention used to define various quantities. However, consis- 
tency requires that in the weak field limit they reduce to the Poisson 
equation (7.79). 


7.6 Cosmological constant 


Let us note here that the Einstein tensor G*” is not the only tensor 
that can be constructed from the metric tensor and its first and 
second derivatives which is conserved. In fact, it is clear that one 
can write a modified equation 


1 
RY — 5 gt” R— Agt’ = 8nGyT"”, (7.122) 


where A is a constant known as the cosmological constant and since 
g'” is constant under covariant differentiation (see (4.103)), this term 
does not violate any consistency condition. However, physically one 
can think of this term as a constant gravitational force acting on 
all particles. Namely, we can take it to the right hand side of the 
equation and absorb it into the definition of T¥”. Then in the flat 
space-time limit, it would correspond to a universal constant force 
acting on all particles. This term is known as the cosmological con- 
stant term and historically Einstein had introduced this term into the 
field equations for gravity in order to obtain a static solution of the 
field equations. However, we note here that if this modified equation 
were the true equation, then it has to reduce to the right Newtonian 
limit. Since Newton’s equation works so well in the classical domain, 
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this puts a limit on the magnitude of the cosmological constant A. 
In fact, just looking at the equation, we note that since the metric is 
dimensionless, the canonical dimension of A must be (recall that the 
curvature involves second order derivatives of the metric) 


[A] = [Z]-?. (7.123) 


Therefore, its value must be related to a characteristic length in the 
problem. The only meaningful characteristic length in the study of 
the universe is its radius. The observational measurements give a 
value for this radius to be 


Radius > 10!° light-yrs 


d 


10° x (3 x 10'° ems/sec) x (3 x 10’sec) 
= 9x10?" cms ~ 10” cms. (7.124) 


(The present observed value for the age of the universe is 13.7 billion 
years leading to a radius of 78 billion light-years.) The theoreti- 
cal limit on the cosmological constant can then be written as and 
compared with the observed value 


Nees =10-" en? = 107 eV, 


(Radius)? 
Aobs & 1078" (eV)?. (7.125) 


This simple argument gives the right order of magnitude for the 
cosmological constant. One of the burning questions in physics today 
is why the cosmological constant is so small and how to derive this 
within the context of physical theories. 


7.7 Initial value problem 


Let us now look at the structure of Einstein’s equations. Although 
what we say is true in the presence of matter, for simplicity we will 
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restrict only to the case of the gravitational field equations in empty 
space (7.97) 


Gu (9) = Rw (g) = 9. (7.126) 


This is a tensor equation and there are ten equations involving the 
ten components of the metric tensor. Furthermore, this involves the 
first and the second order derivatives of the metric tensor as well as 
the metric tensor itself. However, unlike other simple physical equa- 
tions which we come across in physics, here the equation is highly 
nonlinear. That is although the second order derivative terms are 
linear, the first order derivative terms are nonlinear. Therefore, the 
superposition principle which is so dear to us becomes inapplicable 
here. 

The Cauchy initial value problem also suffers from difficulties. 
Since this is a second order differential equation, if we give the value 


of the metric and its first derivative at an initial time x9, namely, if 
0\ _ O>uw 
Iu (X, & ) = fw (x), Ot _ Pie (3), (7.127) 


0 


are known, then we would expect that we can determine the metric 
components at all later times uniquely. However, in the case of Ein- 
stein’s equation this is not possible. This can be seen simply from 
the fact that although the equation 


GH’ = 0, (7.128) 
seems like ten equations, the Einstein tensor satisfies the divergence 
free condition (7.55) 


DG” =0. (7.129) 


These are four constraint equations and hence the Einstein equation 
represents truly six independent equations. The situation here is 
analogous to Maxwell’s equations 


OFM = J”, (7.130) 
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which define a set of four equations and involve second order deriva- 
tives of the vector potential. Thus we expect that if the vector po- 
tential A,,(x) and its time derivative A,,(x) are given at some initial 
time x°, then we can determine the value of the vector potential 
uniquely at any later time. However, as we know this is not true. 
This is because, there exist identities of the form 


0,0,F™ = d,J" =0. (7.131) 


Hence Maxwell’s equations truly represent three independent equa- 
tions. The reason for the existence of constraints in the case of 
Maxwell’s equations is understood as the consequence of gauge in- 
variance in this theory. Namely, under 


Ay > Ay, + 0,a(2), (7.132) 


Maxwell’s equations remain invariant. This also explains why the 
Cauchy initial value problem cannot be solved uniquely in the case 
of Maxwell’s theory because, if we determine one solution which sat- 
isfies the initial conditions, we can always make a gauge transfor- 
mation (with the parameter of the transformation vanishing at the 
initial time) to obtain a different solution so that the solution cannot 
be unique. In this case, we know that the initial value problem can 
be solved uniquely only after choosing a gauge condition. A pop- 
ular gauge condition which is commonly used in Maxwell’s theory 
(respecting relativistic invariance) is the Landau gauge condition, 
namely, 


0,,A" = 0. (7.133) 


That we can always bring the vector potential to satisfy this condition 
can be seen as follows. Let us assume that 


0, AY #0. (7.134) 


In this case, we can make a gauge transformation 


226 7 CURVATURE TENSOR AND EINSTEIN’S EQUATION 


Ay + Al, = Aw + duc, (7.135) 


such that 


0, A" = 0, (AY + a(x)) =0, 


or, a(x) = —0,A*(x), 


Ot “en = [ate Glare OAM"), (7.136) 


where the Green’s function G(x — 2’) satisfies 


eo —s')=-S(2—2’). (F137) 


Therefore, we see that there always exists a gauge transformation 
which would bring the vector field to satisfy the Landau gauge con- 
dition. 

The situation is quite similar in the case of Einstein’s equation. 
The presence of the constraints can be traced to the fact that the 
equations are covariant under a general coordinate transformation, 


xh — a'H (2). (7.138) 


This again shows the difficulty in solving the Cauchy initial value 
problem uniquely. Namely, if we determine a metric tensor g,,(x) 
which satisfies the initial conditions and solves the equation, we can 
always make a coordinate change to obtain another solution (still 
maintaining the initial conditions). Therefore, as in Maxwell’s theory 
we have to choose a coordinate condition in order to solve the Cauchy 
initial value problem uniquely. 

A particularly convenient choice of the coordinate system is 
obtained by imposing 


Peg 0. (7.139) 


We note that if [4 ¢ 0, then under a coordinate transformation 
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xh + x(a), (7.140) 


we have (see (4.22)) 


Ir o T 2 tA @ f 
1 Oa" Oa? OaT py , Oral” Oa? Oa (7.141) 
M Ox Oxl# Oalv On On Oe! Oc” 
Requiring that 
ina = GG ee = 0, (7.142) 
we obtain 
Mw ( »f Ox'* Ax? OxT Ox" Ax? Oat 
sma t+ Savant Bat Bar| = 
Ox Oxl Oalv ORO EF 00 
Ox! OT og OT 
or, Bae 9 (x)V'5, (a) + arr aat 4 (a) =0, 
Ox! Px! 
TP Or —— 
Or, axP (2) + g (x) Ox? Out 0, 
or, ot T°) = —g?" (e) Orn (7.143) 


Ox? OxT 


This shows that if [? # 0, then we can always find a coordinate 
transformation which will bring it to satisfy this condition. 
To see what this condition means, let us note that 


i = ve, 
= gt (= P(g + Odo — Opdur) 
= G 5) g ugup VIpp pI uv 
= =x gg (9 ) ) 
oven gh’ g P( uGup + Ov9pp — aGins) 


1 
= -9? gt” Ongup + 5 Gg Ongiu 
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Ap py 1 Ap gly 
= —g (—(Oug \on3) + 5 g°g Oo uv 


1 
= dy! + Ta Op(V—9) 9" 
1 


= ea OnlW—9 me (7.144) 


where we have used (4.148). Therefore, the condition (7.139) can be 
written as 


P* = a,(V=9 gt) =0, (7.145) 


which is quite similar to the Landau gauge. This is known as the 
harmonic coordinate condition (harmonic gauge). The reason for 
this nomenclature is that if this is true, then we have 


o(z) = D,D*4(x) = Dy(P9(2)) 
= = Ax(V=G P(e) 
= srs (V=99"2,0(0)) 
1 


=  [anlv=a oP) 0(e) + VB ADA) 


= 0°0¢(z). (7.146) 


Therefore, if a scalar function satisfies 


O,o(x) = 0, (7.147) 


then with this gauge condition it also satisfies 


o(x) = D,D*$(x) = 0. (7.148) 


Functions satisfying this condition are known as harmonic func- 
tions. In particular it is easy to see that with this condition, the 
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coordinates themselves become harmonic functions. To see this note 
that although the coordinates carry a vector index, they are really 
not vectors under a general coordinate transformation. In fact 


ch + g/h = g(a), (7.149) 
can be thought of as defining four functional scalar relations. Thus 


under a general coordinate transformation, the coordinates can be 
thought of as scalars. In the harmonic gauge, this leads to 


a! = Dy Da" = &dy2" = 0. (7.150) 


Therefore, with this gauge condition the coordinates, themselves, 
become harmonic functions and hence the condition 


r= 0, (7.151) 


is referred to as the harmonic coordinate condition. 


7.8 Einstein’s equation from an action 


Let us now ask whether we can obtain Einstein’s equation (7.121) 
from a variational principle. To do this, of course, we have to define 
an action for the theory. And to simplify, we will restrict ourselves 
only to the matter free case first. The method generalizes easily to 
incorporate matter which we also discuss afterwards. 

Let us note that the action has to be a scalar under a coordinate 
transformation. We have already seen that (see discussion around 
(3.35)) 


jee —9, (7.152) 


represents an invariant volume element in a curved manifold. There- 
fore, we simply have to look for a scalar Lagrangian density (of weight 
zero) formed out of gravitational field (metric). Furthermore, if we 
restrict ourselves only to quantities involving at most second order 
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derivatives of the dynamical variable (as is conventional in dynamical 
theories), then the choice is unique (this excludes terms of the form, 
say Ry, RY”, R? etc.) 


Ly=6R= Bg Rw, (7.153) 


where £ is a normalization constant to be determined. Therefore, we 
can write the invariant action in the form 


Sz = [es V-glr= 6 f ae v=o 9" Rive (7.154) 


Let us now define the problem of variation in the case of a grav- 
itational theory. The metric tensor in this case is the fundamental 
dynamical variable and plays the role analogous to the trajectory of 
a particle in classical mechanics. The question, therefore, is what is 
the dynamical equation satisfied by the metric tensor which would 
leave the Einstein action (7.154) unchanged if we change the metric 
infinitesimally. Namely, under the infinitesimal variation 


Guv > Gov + O9ur, 

JM Gy > ws 

Jim Oduv — 0, (7.155) 
we would like to know the condition (equation) under which the 
Einstein action would be stationary. To determine the variation of 


the action (7.154) we need various relations which we collect below. 
First, let us note that 


og qux = oN, 
or, 6g?” guy + gM” Ogu, = 0, 
or, dg’ = —g 5gg,q”. (7.156) 


Furthermore (see also (4.148)), 
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1W/=G): = 5 8) 


1 Tr In gyv 
aaile 2 
(—g) LL 
=> “Ogy 
ay—go “SH 


= ‘= gt” Suv (7.157) 


Let us also note from (4.49) that 


1 
Bae = ~95 G°° (OpGue + OvJou = Ox Oia) (7.158) 


which leads to 


1 
oy = =) 69°" (Ou Gve + OVGou 7 OoGyw) 


1 
~ 5 ge (O Oa yee + OVOGopu = Se Ou): (7.159) 


and we will analyze the two terms on the right hand side separately. 
Using (7.156), the first term leads to 


1 
3 9 5 G0p9°° (Ougue + Onder = OeGuix) = — 9°" 6gopV',,. (7.160) 


To simplify the second term in (7.159), let us use the identities 


Dy 8Guc = On09ve ay Ty 5980 a Tio 59uB: 
Dy 5Gon = Ov6Gon +1 Y, 59a +13, 5908; 
D509 ge = “O50G nna 14,096 + To ogy, (7.161) 


which lead to 
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CD 0 Gna Dp bGai: Ded dun) 
= (0,590 + O6Gou — Oo5Guv) + 209,69 60: (7.162) 


It follows from this that 


(0.090 oT OvOGou a O06 Gu) 


= (Dy 69v0 + Dyd59on — DoS Guv) — 20,5980. (7.163) 
Consequently, the second term in (7.159) gives 


1 o 
=5 9 (DydGv0 + Di5Gon ~ DodGu ~ ViydGae ) 


1 
= =) oo (D8 9ve ae Di OGopu = DedGie)* 9 SgagV v- (7.164) 


Adding (7.160) and (7.164) we obtain 


1 
Os = ~95 9° (D6 Guo sr DiOGou = Dsdguv), (7.165) 


which shows that the variation of the connection (Christoffel symbol) 
is a tensor as we have noted earlier. 

We are now ready to calculate the change in the Ricci tensor. 
We know from (5.18) that the Riemann curvature tensor is defined 
as 


Bess = Oly — Op a el Bee = Res ne (7.166) 
so that 
Rup -~ Lee 


= 0,7, -9,T%,+T#,07, —T4,T%,- (7.167) 


Therefore, we have 
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ORup = or _ O01, + Oe + eco ale 


SO i pol py 


(O,or4, mee re AAEM wal Res ieee Pe ol ye) 


— (0,61, — TH ,609, + 17,004, +17, ,50 2) 


De (Ores) — Di (OFF) 


or, dORyp 


D, (or’,) Dy (or3,) (7.168) 


This is known as the Palatini identity. Let us note further that the 
Ricci scalar is given by (7.49) 


R=g’ Rw, (7.169) 


which leads to 


6R = Og Ruy + gf ORw 
= —gg?’SgyoRuw + gh” (D. (ora,) =5 (r3,)) 


= —59,,RY” + Dy (st”or,) Dy (s”or,) 


1 
= + =. py = py spr 
= OI R Ser 0, (v 99 or, ) 
1 
4 SS = BY spr 
=k (V=9 9 ors.) (7.170) 


where we have used (7.156), (7.165) and (7.168). 


Therefore, we finally obtain the variation of the action (7.154) 
to be 
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OS 5 


gf ate ( d‘z (6(\/—g)R + V—g 5R) 
= 6 fae ns V=G 9!" FquR 


+V=5{- O9guy RY +a (V=9 gt”or ») 


1 r 
~ V=9 O» (v —g9 of” 50, ) \) 7 (7-171) 
We note that the last two terms in (7.171) are total divergences and 
if we assume the connections to vanish at infinity, then we can ignore 
the last two terms and rearrange the other terms to write the change 
in the action as 


6s g f ate yr (5 gt” R— ne) 59uv- (7.172) 


Since we require that the action be stationary for arbitrary infinites- 
imal variations of the metric 


5Sp =0, (7.173) 


this leads to 


; gt” R— RM = 0, 


1 
or, Re — 5 ge h=G=0, (7.174) 


This is the correct equation for empty space (7.97) and we have 
derived it from an action through a variational approach. Note that 
the normalization constant 6 remains undetermined at this point and 
gets determined only when matter is included. 

In the presence of matter, we can generalize the action (7.154) 
to 
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S = Sr + Smatter 
= 6 f ate V=gR+ f de Y= Lannie (7.175) 


If we now look for a variation of the complete action in (7.175), we 
can write 


6S g fate V9 99 uv (5 gg’ R- ne) 


1 y 
+ fats vg (-5) eee ate 


1 
[ue Vg 09 E (Sue _ ne) = Tite : (7.176) 


Here, although we have not yet derived, we are using the fact that 
we can write the change in the action for matter as 


il 
OS wiather = 9 [as V—-g lod Magee (LAT?) 


In fact, this is quite plausible recalling the fact that, in the case of 
charged particles coupled to the Maxwell field, the variation of the 
matter action (under an infinitesimal change of the gauge field) can 
be written as 


6 Simatter = / d*z 5A, J*, (7.178) 


where J" represents the source for the Maxwell theory. Clearly if 
the action (7.175) is to be stationary for an arbitrary infinitesimal 
variations of the metric, namely, 


55 =0, (7.179) 


then it follows from (7.176) that 
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: HY RR REY : ee =0 
B 9 g as a 9 matter — “> 


1 1 
or, Rw 5 gaa = ay Y Baca (7.180) 


Comparing this with Einstein’s equation (7.121) 


1 
GY = RY 5 Gg R= SaGule tee (7.181) 


we determine 


1 


B = 
This determines the normalization constant uniquely and thus we 
can write the Einstein Lagrangian density (7.153) for the matter free 
case as 


1 
Le “Tone R. (7.183) 
Written as an action the analogy with the Maxwell theory is obvious. 
The analogous gauge invariance in this case is the invariance under 
a general (local) coordinate transformation. Let us also note that 
a cosmological constant can also be incorporated into the Einstein 
action (Lagrangian density) simply as 


1 


fa 167Gy 


(R+A). (7.184) 


CHAPTER 8 


Schwarzschild solution 


As a simple application of Einstein’s equations, let us determine the 
gravitational field (metric) of a static, spherically symmetric star. 
Many stars conform to this condition. There are also many others 
that behave differently. For example, a star may have asymmetries 
associated with it, it may be rotating or it may be pulsating. How- 
ever, the static, spherically symmetric star is a simple example for 
which the metric can be solved exactly. Therefore, it leads to theo- 
retical predictions which can be verified as tests of general relativity. 


8.1 Line element 


Although Einstein’s equations are highly nonlinear, the reason why 
we can solve them for a static, spherically symmetric star is that the 
symmetry present in the problem restricts the form of the solution 
greatly. For example, since the gravitating mass (source) is static, 
the metric components would be independent of time. Furthermore, 
the spherical symmetry of the problem requires that the components 
of the metric can depend only on the radial coordinate r. Let us recall 
that in spherical coordinates, the flat space-time can be characterized 
by the line element 


dr? = dt? — (dr? + r?(d6? + sin? 6d¢’)). (8.1) 
We can generalize this line element to a static, isotropic curved space 
as 


dr? = A(r)dt? — (B(r)dr? + C(r)r?d0? + D(r)r? sin? 6d¢g?). (8.2) 


The following assumptions have gone into writing the line element in 
this form. First of all since the metric components are independent 
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of time, the line element should be invariant if we let dt + —dt. This 
implies that linear terms in dt cannot occur. Isotropy similarly tells 
that if we let dd + —dé@ or d@ > —d¢, the line element should be 
invariant. Thus terms of the form drdé, drd¢ or d@d@¢ cannot occur 
either. This restricts the form of the metric to be diagonal. 

Let us now look at the line element (8.2) at a fixed time and 
radius. At the north pole (dé = 0) with € = rd6, we have 


dr = —Clr)e’. (8.3) 
On the other hand, if we look at the line element in the same slice 
of space-time but at the equator (@ = 4) with e = rd¢, then 

dr? = —D(r)e?. (8.4) 
However, if the space is isotropic then these two lengths must be 
equal which requires 

C(r) = D(r). (8.5) 


Thus we can write the line element (8.2) as 


dr? = A(r)dt? — B(r)dr? — C(r)r?(d6? + sin? @d¢?). (8.6) 
There is a simpler way of understanding this result. The spherically 
symmetric line element (8.1) 

dr? = dt? — dr? — r7d?, (8.7) 
with the angular element 

dO? = (d6? + sin? 6d¢”), (8.8) 
generalizes to a static, spherically symmetric curved space as 


dr? = A(r)dt? — B(r)dr? — C(r)r?dQ?. (8.9) 
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We note here that the function C(r) in (8.6) is redundant in the 
sense that it can be scaled away. Namely, if we let 


r3F=([C(r))2 1, (8.10) 


then 


dé = dr|(C(r))2 + 


or; dr = FH )dr, (8.11) 


where we have identified (prime denotes a derivative with respect to 
y) 


i 


Gy yea (8.12) 
Let us also define 
A(r) = A(#),  B(r)dr? = B(#)d?”. (8.13) 


Thus with this scaling the form of the line element (8.6) becomes 


dr? = A(#)dt? — B(#)dF — F°(d6? + sin? 6d¢?). (8.14) 


This shows that with a proper choice of the coordinate system the 
line element for a static, spherically symmetric gravitational field can 
be written as 


dr? = A(r)dt? — B(r)dr? — r?(d6? + sin? 6d¢7), (8.15) 


which is known as the general Schwarzschild line element. 

Birkhoff has demonstrated that the requirement of a static grav- 
itational potential is superfluous for the Schwarzschild line element. 
In fact, he has shown that even when a spherically symmetric star or 
a gravitational mass is undergoing a radial motion, the line element 
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can be written in the Schwarzschild form (8.15). This is known as 
Birkhoff’s theorem and is important in the sense that it rules out 
the possibility of gravitational waves being emitted by radial pul- 
sars since the metric in this case can be equivalently thought of as 
static. Therefore, to be able to emit gravitational waves, if such a 
phenomenon exists, a star must undergo different deformations. 


8.2 Connection 


As we see now, the Schwarzschild line element (8.15) is given in terms 
of two unknown functions A(r) and B(r). The metric components 
can be read off from the line element (8.15) to be 


goo = gu =Alr), 

91 = Grr=—Bl(r), 

922 = Goo = —r, 

933 = Jos = —T sin” 0. (8.16) 


This is a diagonal metric and hence the nontrivial components of the 
inverse metric can also be easily written down as 


1 
00 _ te 
g —- g A(r)’ 
1 
TS og: aT, 
g —< g B(r)’ 
1 
a 00 eae tes 
g ~~ 2? 
OC 2 = (8.17) 
r2 sin? 6 


We can solve Einstein’s equations far away from the star to 
determine the forms of the functions A(r), B(r). That is, outside 
the star we can solve the empty space equation (7.97) 


Rwy = 0, (8.18) 
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subject to the boundary condition that infinitely far away from the 
star, the metric reduces to the Minkowski form (8.1). To solve 
Einstein’s equations we must, of course, calculate the connections 
and the curvature tensor. For example, from the definition of the 
Christoffel symbol (4.49) we have 


1 
a = =) g? (OvGdp +r ONIpv = OpGu2)s (8.19) 


and since we know the metric components, these can be calculated. 
But this method is tedious (as we have pointed out earlier) and hence 
let us try to determine the components of the connection from the 
geodesic equation which has the form (see (5.130)) 


dag. da* da? 
Juv ae Ba 5 (O.9pu ae OpGur = Ou9p) ae ae = 0. (8.20) 


For « = 0 (t-equation), this leads to 


d2z° 1 da* dx? 
900 a + 5 5 (PrGp0 + Op90 — P09Xp) eee a 0, 
d?x° dx® da! 
or, A(r) dr “72 Ts 1900 —— ae ‘Cae = 0, 
d?z®° = Al(r) dz® dx} 
Sy poac ee 21 
a dr? A(r) dr dr ie 
Comparing this with the 4 = 0 equation of (see (5.83)) 
dat! da* dx? 
cs [ee . 
dr2 Ap dr dr 0, (8 22) 
we determine 
1 A’(r) 
SS 
O01 10 9 A(r)’ (8 23) 


Here prime denotes differentiation with respect to r. 
For ps = 1 (r-equation), from (8.20) we obtain 
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Aeyas 4 da* dx? 
gu a + 3 (Ongp1 + Apgar — A929) ‘dr dr ®, 


d?x!} dx® 1 da! 
—B 
dr2 2 5 O19 (= dr ) 3 O19 (= dr ) 


dx? 1 da? a 
et = 8 = 6 — |) =0 
1922 (= ae ie 5) 1933 ( as ) ’ 


deg). , (ax \? 1 ,(dzl\? 
oe, ota) ee, 


or, 


d2x! 


1 
a dr2 2 


A’ (dx® ae B' (da!\? 
B\ dr 2 B\ dr 
3\ 2 
-3(S aes sn? (=) =0. (8.24) 


Thus, comparing with the 4 = 1 equation of (8.22), we obtain 


' = 1 B 
To = 3 4s, Thy = ~9 BR: (8.25) 
7 2 : 
Th = & rh = Re 
For = 2 (6-equation), equation (8.20) leads to 
d?x? 1 da* dx? 
922 Ga +5 (Oy gp2 + Opg2x — 029rxp)—— i. ae = 0, 
d2x? da! dx? 1 dx? \? 
2 = 
or, rapt 1922 ae eT 8 02.933 ( ae ) = 0, 
d2x? da! dx? da? \? 
2 = 
or, 7 2r — a +r? sin @ cos 6 (=) = 0, 
d?a?_ 2 da! da? dx3\? 
—>, +- — —-— sind 6{ —)] =0. 8.26 
pa de 'r dd ( dr ) (8:20) 
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This determines 


1 
ee = nee ee oe = sin @ cos 6. (8.27) 


Similarly for 4 = 3 (¢-equation), we obtain from (8.20) 


933 see + 5 (9,993 + Op93r — 039rp) a — = 0, 
or, —r? sin? 0 ce + ee ua + 02933 ee = = 0, 
or, —r? sin? 6 ne — 2r sin? 6 ee bia 
—2r? sin 8 cos 6 = = 0, 
or, Se Sieg =, (8.28) 
which yields 
=r. = -5, r3, =T3, = — cot. (8.29) 


Let us collect all the nontrivial components of the connection 
which are given by 


Toi = Mo = —3 re 

To = -3 Th = -3% 

th = & Th, = vaste, (8.30) 
iy = ha, T32, = sin@cos9, 


Ty. = TS [33 = 3, = —cotd. 
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8.3 Solution of the Einstein equation 


It is now easy to calculate the Ricci tensor from the connections 
(8.30). We recall that the Riemann-Christoffel curvature tensor 
(5.18) is defined to be 


ine = OT a OV hy T ee nae ie (8.31) 
so that 
d 
Ruy — OF ane 
= 0,0, — 0, +1,F%, —03,0%y. (8.32) 


It follows now that 


Roo = O19, — OF Go + TRG — Tol Sa 


—A:T 9 + TRP 00 — Tol Go — Tol Gi 


= -0:T G0 (To. tT, +13, + T3,) V0 — Vil bo — Pool 1 


| 
| 
Q 
fi 
ah 
on 
5 
| 


F (Io, +1}, +13, + T3)) Too 


1 O A’ 1A’ 1B’ 1 1 1 A’ 
2a \B/T\2 a4 2B + 7) 2B 
1 0 A! 1A’ f/1 A’ 1 zB’ 2 
-5m (3)-35 GG-a5-3) ees) 


so that the 00-component of (8.18) leads to 


sie ace A IAN DA AA eB” 29 Ne A 
a LA 2 BO py 


or, 


A” 1 A! A!’ B' A! 
ss-iz (ats) 
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Similarly, we have 


Ry = ATH - AT} +P3,Pq -Th0H 


= O (Yo tT, +i 4 Ts) aT 


40h (Tor +1}, +13, + T31) 


0 pO 1 pl 2 p2 3 p3 
Polio > Ply _ Pyol {2 _ Pi3l\3 


Ba Na BN (ea 22 
Or a ae 2B 2A 


Pb fANA. Ce. oD 
“4 VA) 72 PP 


A” 


A” 1 A! A! B' B' 
= "3A A (=) ($+) EB: 


Thus, the 11-component of (8.18) leads to 


Ry, = 0, 
A” 1 A’ A’ B' B' 
" 24°79 (=) (4+5) “oR 


We also have 


Rog = O20, — O03, + 14,13 — 13,53, 


= 0133 — ATS. + Td, (Tb. +1}, +13, + 13) 


1 p2 2 pl 3 p3 
= PoP = P31 P59 ~, P3193 


ANS 5 2c A'\ (B' 
A rr ANVAI\VB 
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(8.35) 


(8.36) 
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= cosec’s — 2 (5) + (5) (-3 4-5 5+7-3) 


— cot? 6 
me 1B lr A’ B 
7 B B2 2B\A_ B 
1 r A’ B’ 


Thus, the 22-component of (8.18) yields 


Ro = 0, 

1 r A B 

ie, mes ip ees, Wh ese 
on, Zt+55 (5 a) (8.38) 


BA 
“7B pe 
A’ B' 
oe A Be 
or, AB=constant =k. (8.39) 


If we now impose the boundary condition that at large distances the 
metric becomes Minkowskian (see (8.1)), we have 


lm A(r) —> 1, 


TCO 
lm Bir) - 1. (8.40) 


Too 


This therefore, determines the constant of integration in (8.39) to be 


k=1, (8.41) 


and we have 
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A(r)B(r) = 1, 
=z (8.42) 
or, AG 
If we now substitute this relation into (8.38), we obtain 
rA (A A’ 
AG EES 45 
or, A(r)+rA'(r) =1, 
eA ss 1, 
dr 
or, rA(r)=r+constant =r+m, (8.43) 
so that 
AQ ep 
Ze 
1 my\—1 
Br) = ay = ( i =) (8.44) 


Here m is a constant of integration to be determined. 
We can now write down the Schwarzschild line element (8.15) 
in the form 


<i 
a (1 + a) dt2— (1 as =) dr?—r?(d0?-+sin? 0d@2). (8.45) 


Let us emphasize here that there are ten equations of Einstein 


Rwy = 0, (8.46) 


and we have used only three of them to determine the form of the 
Schwarzschild line element. Therefore, it remains to be shown that 
the other seven equations are consistent with the solution in (8.45). 
(Namely, since there are ten equations involving two unknown func- 
tions, it is an overdetermined system and we should check for the 
compatibility of the solution.) In fact it can be easily shown that 
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Rw = 0, for pFy, 
R33 = sin? 6 Roo = 0, (8.47) 


so that all the ten equations are consistent with the line element 
(8.45). 

To determine the constant of integration m, let us note that 
very far away from a star of mass M we have seen that the metric 
has the form (see (6.21)) 


2Gy M 
goo = 14+ 2¢(r) =1— : ; (8.48) 


where M denotes the mass of the star. Comparing this with the 
solution in (8.45) we determine the constant of integration to be 


m = —2G,M, (8.49) 


so that the Schwarzschild line element (8.45) takes the final form 


= 4 
seein (: 7 ae dt2— ( “s ae dr?—r?(d62-+sin? 0d¢2). 
(8.50) 


This determines the form of the line element and, therefore, the 
metric uniquely. 

One striking feature of the Schwarzschild metric in (8.50) is that 
at r = 2G, M, 


goo =9, Grr > ©. (8.51) 
That is, the Schwarzschild metric is singular at the Schwarzschild 
radius defined by 


Ts = 2GyM. (8.52) 


For most objects, this radius lies inside the object. For example, 
since (in order to have the appropriate dimension, the actual constant 
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appearing in (8.50) (or (8.52)) is the Newton’s constant Gy defined 
in (7.80) divided by c? which we have set to unity in our convention, 
but which can now be restored for calculations) 


Gy ~ 7 x 1077? cm gm7}, 


M (earth) ~ 6 x 104 kg = 6 x 107’ em, (8.53) 
the Schwarzschild radius (8.52) for earth has the value 
rs(earth) = 2GyM(earth) ~ 2x 7x 1077? x 6 x 107” cm 
= .84cm, (8.54) 


which is well inside the earth (r(earth) ~ 6 x 10°m ~ 6 x 10° km). 
For the sun, we have 


M(sun) ~ 2 x 10°° kg = 2 x 10° gm, (8.55) 
so that 
rg(sun) = 2GyM(sun) ~ 2x 7x 10-7 x 2 x 10°35 cm 
= 28x 10°cm ~3 km. (8.56) 


We can compare this with the radius of the sun which is given by 
Tsun % 7 X 10° km. (8.57) 
Let us, for the sake of curiosity, calculate the Schwarzschild radius 


for the proton. We know that 


m(proton) ~ 1.6 x 10774 gm, (8.58) 


and, therefore, 


rs(proton) ~ 2x 7x 107-7? x 1.6 x 107% cm 


I 


2.2 x 107°? cm. (8.59) 
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We can compare this with the radius of proton which has the value 


r(proton) ~ 1.2 x 107!8 em. (8.60) 


Thus we see that for most objects the Schwarzschild radius indeed lies 
inside the object which is, of course, beyond the region of validity 
of the solution (which holds true only in the exterior of the star). 
Therefore, we can ignore this singularity in the metric in such cases 
since the solution in the interior of the star is given by a different set 
of equations. 

For some other astronomical objects like the neutron stars, how- 
ever, the Schwarzschild radius is quite close to the physical radius 
of the object. In this case, we cannot neglect the singularity in the 
metric. However, the curvature tensor in this case can be shown to 
be singularity free. Thus the apparent singularity in the metric is an 
artifact of our choice of coordinates and can be rotated away by a 
suitable choice of coordinates. We will see this later. However, we 
note that if we compress matter into a star hard enough so that its 
physical radius is smaller than its Schwarzschild radius, then we ob- 
tain a black hole which we will also discuss later. The Schwarzschild 
radius is often referred to as the event horizon for reasons that will 
become clear shortly. We note that if r << rg = 2GyM, i.e., when 
the Schwarzschild radius is larger than the size of the object and we 
go inside the Schwarzschild radius, then 


go <9, gi > 0. (8.61) 


Namely, the time and the radial coordinates seem to exchange their 
roles. This leads to a lot of interesting phenomena. Let us emphasize 
here that the Schwarzschild solution does have a genuine singularity 
at the origin r = 0 when the star is a point mass object. This is not 
surprising since such a singularity occurs even in classical gravity. 


> Example (Schwarzschild solution in the presence of a cosmological constant). 
As we have seen in (8.50), the Schwarzschild line element has the form 


dr? = A(r)dt? — B(r)dr? — r?dQ? 


= 
= (1 = a dt? — (1 = ae dr? — ra. (8.62) 
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We obtained the line element by solving the Einstein’s equations away from the 
star where 


Rw =0, pv =0,1,2,3, (8.63) 


for only the components Roo, Rii, and R22. Since there are 10 independent 
components of Rv, we still have to show that the line element (8.62) also solves 
the remaining seven equations 


R33 = Roi = Roz = Ros = Riz = Ris = Roz = 0. (8.64) 


To proceed, we recall that the components of the connection have been 
determined in (8.30) to have the form (see also the determination of A,B in 
(8.44) and (8.49)) 


re, =r? ey ioe _ Gri 
s - 2A r2 —2GyMr’ 
pA GxM (,_ 2GxM 
pe 2B r r , 
po BGM _ 
ve 2B r2 —2GNMr°’ 
Pp = B= 7 2GuM, 

1 
Tie =Ta = —<, 
: 1 
lis =Ta = — =, 


T33 = (r — 2GnM)sin? @, 
T2, = sin @cos0, 


rs, =P, = — cot. (8.65) 


Furthermore, from the definition of the Ricci curvature tensor in (8.32) 


Ry S01 Oly PP oh pS Pek (8.66) 


we obtain 
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R33 = 0303) — 0133 + VX6033 — 13503) 
= —Asin? 6 — rA' sin? 6 — cos 20 + sA'r sin? 0 — Asin? 0 
+ 5 sin 20 cot 6 + Asin? 6 — ; sin? Or A’ 
= — Asin’ 6 — (1— A)sin? 6 — 1+ 2sin? 6 + cos? 6 


. 2 2 2 2 2 
= —sin® @—1+sin* 6+sin* 6+ cos’ 0 


=0, 


Roi = Oo1}, — 0 to + TAT fo — POF ha 
=PRoG1 — Mol G0 — PPO 
=0, 


Roz = oT 9 AT oo PRT oe = 13.08, 


= 0, 


Ro3 = O30, — AT 03 + PX00$3 — F300 ba 


= 0, 


Ri = OT}, — ATH + PR %o -— THT Ia 


3 p2 1 pl 2 p2 3 p3 0 po 
= T3202 Tain To2T 2 T30\3 Toto 


= 0, 


Rig = Ost, — AT 43 + THoM%3 — THES 
=0, 


Rog = O33) — O4093 + TRoV93 — P30F 2. 


=0. (8.67) 


Here we have used the fact that the connections are independent of the azimuthal 
angle ¢@ and that only some of the connections depend on @. This shows that the 
line element (8.62) also solves the remaining seven Einstein’s equations. 

In the presence of a cosmological constant, Einstein’s equations (7.122) (see 
also (7.121)) become 


1 
Ruv = 87rGn (Tv a 59T) + Aguv, (8.68) 


where A denotes the cosmological constant. Far away from the star, we have 
T'’ =T =0, and (8.68) takes the form 


8.3 SOLUTION OF THE EINSTEIN EQUATION 253 


Ruv = Agur- (8.69) 


For the spherically symmetric line element (8.15), we have already deter- 
mined the Ricci curvature tensors and the relevant equations in this case, take 
the forms (compare with (8.34), (8.36) and (8.38) respectively) 


A” A’ A’ B' A’ 
Roo = 55 (4+5)+3-a4 


2B 4B\A' B rB 
Al A’ A’ B' B' 
1 rT A’ B’ 2 
Ra =1- 5-55 (4-3) =-ar (8.70) 


It follows from (8.70) that 


B 1/A’ B’ 
GZ Foo + Ria =- (= =) =0, 
r 


d 
or, a. In(AB) = 0, (8.71) 


which determines 


A(r)B(r) = constant = 1, (8.72) 


as before. Using this in the R22 equation, we obtain 


AG) A 
1— A(r) — rA(r) x Alr) —Ar’, 
d 7 2 
or, ql") =1+ Ar’, 
or, A(r)=1+ ahr + =, (8.73) 


where m is an integration constant. It follows now from (8.72) that 


Bi) = aw 2 (: a: gar’ + my (8.74) 


As in the case of the absence of the cosmological constant, we can determine 
m = —2GyM, so that far away from the star we get the familiar solution goo = 
1— oon Mt (see (8.49)), if A is extremely small (as is normally the case). 

The Schwarzschild line element then becomes 
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4 
ar? = (: 4p zAr® 7 ae dt? — (1 + sar" = oo dr? — 72a Q?. 
(8.75) 


< 


> Example (3-dimensional space of constant curvature). A space is said to be 
of constant curvature if the Riemann-Christoffel curvature can be written in the 
form 


Rywrp = K (Gur9ve — GueGvra) ; (8.76) 


where K denotes the constant curvature of the space (see also (7.75)). 
Let us consider an isotropic 3-dimensional space of constant curvature K. 
For an isotropic space we can write the line element as 


dr? = A(r)dr? + r7dQ? = A(r)dr? + r?(d0? + sin? 6 dd’). (8.77) 


The nontrivial components of the metric tensor for this space are given by 


gui =A(r), go2=r7’, g33 =r’ sin 0, (8.78) 


and the Lagrangian for the geodesic, in this case, is given by (a dot denotes a 
derivative with respect to 7) 


L = (Ar? +1776? +r? sin? 647). (8.79) 


Nle 


From the Euler-Lagrange equations we can determine the affine connections as 
in section 8.2 and the nontrivial components have the forms 


Tai = ae 

ln =, 

[33 = 5 sin? 0, 

Peer oes 
Tr 

ri =1,=-¢, 


13, = sin 0 cos 0, (8.80) 


V3, =T3, = —cot 0. (8.81) 
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We can now compute the components of the Riemann-Christoffel curvature 
tensor 


Ruvrp = Guo (OoT2y ~ Clee + Py aten _ Tp): (8.82) 


For example, for u = A = 1,v = p = 2 we have 


Rio = gio (V3, — AT 32 NN V3.0 42) 


r r A’ lr 
ey Al 2 
o A aoa Aa 
rA’ 
=". (8.83) 


On the other hand, since the space is of constant curvature, we conclude from 
(8.76) that 


Rize = K (911922 — 912921) = Kgi1g22 = K Ar’. (8.84) 


Comparing (8.83) and (8.84) we obtain 


rA’ 


=, 2 
DA = KAr : 
/ 
or, 4 = 2Kr, 
1 
or, A(r) = a= hres (8.85) 


where c is a constant of integration. Requiring that the space corresponds to flat 
space (in spherical coordinates) when the curvature vanishes, namely, 


lim A(r) > 1, (8.86) 


kK—-0 


we determine the constant of integration to be c= 1 and 


1 
A — : 
(y=. (8.87) 
so that the line element (8.77) takes the form 
ft aaa (8.88) 
7 = le : ‘ 


There are three cases to consider now. For kK = 0, we get the familiar 
“flat-space” geometry described in spherical coordinates, 
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dr? = dr? +r? dn?. (8.89) 
For K > 0, and Kr? < 1 (equivalently, r < KN 
the universe) we can define 


, which is the radius of 


: 2 
— sin(VKp) age dr (8.90) 
VK 1— Kr? 
and the line element (8.88) becomes 
dr? = dp? + z sin? (VK p) dO, (8.91) 


which describes a spherical space with an oscillating radius + sin?(/Kp). (Note 


that for small Kp, the line element becomes dr? © dp” + p? dM). 
Finally, for K <0, with K = —|K]|, we can define 


sinh(,/|K|p) 2 dr? 
= —_—L_——,_ dp = —— 8.92 
JK ‘ 1+ |K|r? ee) 
and the line element takes the form 
dr? Sap? ra sinh?(./|K]p) do. (8.93) 
This describes a pseudosphere of expanding radius. 
< 


> Example (Angular momentum conservation). Given the Schwarzschild line el- 
ement (8.50), the Lagrangian for the geodesic in such a space is determined to be 
(see (6.50) and a dot denotes a derivative with respect to 7) 


L= =(Ai? — Br? + r°(6? + sin? 6¢°)), (8.94) 


NlR 


where, as we have seen in (8.44) and (8.49) 


— 2GyM 


r 


A(r) =1 = Bl(r). (8.95) 


The generalized conjugate momenta follow from (8.94) to be, 


OL _ _ OL 


p= OE = A(r)t, Pr = OF = —Bi(r)r, 
OL F OL aot 
pe= a= r?6, a ae r? sin? 06. (8.96) 
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The geodesic equations for the 0, @ components can now be written as (see (6.51)) 


dpe =r’ sin 0 cos 66”, 

dr 

dpe _ 

rr — 0. (8.97) 


Let us consider the total (orbital) angular momentum of a particle, moving 
along the geodesic, defined by 


Lox = Do + =. (8.98) 


Using (8.97) as well as the definitions of the conjugate momenta in (8.96), we 
obtain 


dpo , 2ps dpy  _2¥5 


= nig 06 
Pear ' sin?0 dr sin?@ 
,2, 2p, cos 6 
= 2po(r” sin 0 cos 0¢7) — a ee 
sin’ 0 
; : 212 sin? 64)? cos 0 6 
= 2(r70)(r? sin 8 cos 0¢”) — oe eee 
sin” 0 
=0, (8.99) 
proving that L?,, = constant. Namely, the orbital angular momentum of a 


particle moving along a Schwarzschild geodesic is conserved. 
< 
8.4 Properties of the Schwarzschild solution 


Let us try to understand the features of the Schwarzschild solution 
(8.50). It is an exact solution and the line element has the form 


=f 
Gee ( 2 a dt? -(1 ie aa dr? — r2(d6? + sin? 6d¢2). 
(8.100) 


First of all, let us note that if the mass of the star is zero (M = 
0), then the line element simply reduces to that of flat space-time 


258 8 SCHWARZSCHILD SOLUTION 


of relativity (8.1) as it should. (This is also the asymptotic form 
(space) of the Schwarzschild solution.) Consequently we can identify 
the time and spatial coordinates to be those as measured in that 
coordinate frame. In the presence of a star we can continue to identify 
these coordinates as before (namely, as observed by an asymptotic 
observer), the effect of curvature manifests mainly in the term 20 nM 
Let us also note that the Schwarzschild solution mainly investigates 
the effects of a single star in the external region without considering 
the effects of other masses in the universe. This is a reasonable 
thing to do in the vicinity of a star where the effect of other stars is 
negligible (if the other stars are far away). 

Furthermore, it is a property of static spaces that the space-time 
splits into a time direction and spatial hypersurfaces. The angular 
coordinates 0, ¢, of course, take values 0 < 6< a7 and0<@< 27 
as in flat space. The radial distance, however, is restricted to lie 
between ry <r < oo where ry, is the larger of rg and regtay. Since 
the metric is static we can assume —oo < t < oo. Let us also note 
that the static nature of the space allows us to identify the three 
dimensional spaces as fixed in time. We can, in fact, study the three 
dimensional surfaces (at any fixed time) given by 


-1 
ds? = (1 a an dr? + r?(d6? + sin? 6d¢?). (8.101) 
r 


Furthermore, if we restrict ourselves to surfaces with constant r, then 
we can write 


do” = r?(d6? + sin” d¢”). (8.102) 


This is, of course, the surface of a sphere embedded in a Euclidean 


space. The circumference at the equator (i.e., 0 = 4) is given by 


Qn 
[eo =i d@ = 2nr. (8.103) 
0 


Thus by measuring the circumference of the great circles we can in 
fact obtain the value of the radial coordinate uniquely. This is like 
in flat space. 
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However, if we look at the radial increment of the line element 
(8.101) for fixed 6 and ¢, we obtain 


oa 
ane (1-7) "des 
, 
or, dR>dr. (8.104) 


This shows that r no longer correctly measures the radial distance. 
However, we note that as we go infinitely far away from the star we 
have 


dR = dr, (8.105) 


and r does measure the radial distance in this limit. This can be 
understood on physical grounds simply as due to the effects of cur- 
vature. That is, the space becomes curved near the star such that 
any radial cross-section projects onto flat space-time as a circle of 
radius r. However, because of the curvature, the radial distance be- 
tween the projected circles which gives dr is smaller than the actual 
length between the two corresponding radial cross sections as shown 
in Fig. 8.1. Similarly we note from (8.50) that a clock at rest records 
the proper time to be 


dR 


Figure 8.1: Radial increment on a curved space compared with its 
projection on a plane. 
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al 
2GyM \ 2 
dr = (1 2 ae * dt. (8.106) 


These results are quite similar in form to the relations in special 
relativity with which we are familiar, namely, (c = 1) 


Nie 


1 
M \ 2 
a (1 - aa dt, — dtsp = (1— v*)? dt, 


si 


1 
2GyM\~3 1 
dR = (1- x ) dr, dLisy = (1— v2)? a. 


r 


(8.107) 


However, there are important differences. First of all we note that 
the Lorentz contraction or dilation in the case of special relativity is 
a constant factor whereas in the case of the Schwarzschild solution 
it is position dependent. Furthermore, the relations in the case of 
the Schwarzschild solution are coordinate dependent. If we use a 
different coordinate system, these relations would change. 


8.5 Isotropic coordinates 


In general most soluble problems in gravity have more than one sim- 
ple coordinate system where exact results can be obtained. Another 
such coordinate system for the Schwarzschild solution is the isotropic 
coordinate system. Here we would like to write the line element in 
the isotropic form 


dr? = A(r)dt? — B(r)ds?, (8.108) 


where ds? denotes the line element for the flat three dimensional 
Euclidean space. The reason behind looking for an isotropic line ele- 
ment is that the spatial element ds? is invariant under changes in the 
spatial coordinate system. Namely, we can express the space coor- 
dinates by Cartesian coordinates x, y, z or by spherical coordinates 
or by cylindrical coordinates and the line element would still have 
the same form. Furthermore, if we look at the cosine of the angle 
between two vectors in this three dimensional space 
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_ ein 
lela’ 


cos @ = (8.109) 


it is the same as in the flat Euclidean space (the metric cancels out 
in the numerator and in the denominator). That is, this coordinate 
system is angle preserving and for this reason this line element is 
often also referred to as the conformal line element. The coordinate 
system in which the line element takes this particular form is known 
as the isotropic coordinate system. 

Let us assume that there exists a coordinate system parameter- 
ized by p(r), 6 and @ such that (unlike the line element in (8.50), 
here B(r) is not the inverse of A(r)) 


dr? = A(r)dt? — B(r)ds? 


= (1 aS “ett) dt? — B(r)(dp? + p?(d6? + sin? 6d¢’)). 
(8.110) 


Comparing this with the Schwarzschild line element (8.50) 


= 
ek (1 : “on di? — (1 z oe dr?—r?(d6?-+sin? bd¢2), 
r 
(8.111) 


the angular components lead to 
r? = B(r)p?(r). (8.112) 
Similarly comparison of the radial component gives 
IGhM\~ 
B(r)dp? = (1 — on dr?, 
r 


2 —1 
or, r? ee = (1 — ao dr?, 
p 


r 
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dp dr dr 
Or, —_— = TTT — Sapa aa 
OO (1 2 2GxM) 2 (r2—2GyMr)2 
d 
ee ee (8.113) 
r= GyM)? — (GyM)? 
This can be integrated to give 
In p(r) = In Ir — GyM 4+ Vr? -2GyMr|4+C, (8.114) 


where C is a constant which can be determined from the boundary 
condition that as r > oo the line element (8.110) should correspond 
to that of the flat space-time and hence (asymptotically B(r) — 1) 


lim p(r) > 1. (8.115) 


Too 


This leads to 


lim Inp(r) > In(r+r)+C=lInr+mn2+C=Inr, 


TCO 


or, C=-—In2, (8.116) 


and determines the form of the solution in (8.114) uniquely to be 


In p(r) = In|r — GyM 4+ Vr? — 2GyMr| - In2, 


or, 2o(r) = (r — GyM) + Vr? — 2GyMr, 


Oty. PUk= l(r — GyM)4+ Vr?- 2GxMr| ; (8.117) 


Once we know the coordinate dependence of p(r), we can ex- 
press the metric coefficient B(r) as a function of p. We note from 
(8.117) that 
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(2p(r) — (r — GyM))? = 1? — 2GyMr, 
or,  4p*(r) — 4p(r)(r — GyM) + (r — GyM)* = r? — 2Gy Mr, 
or,  4p* —4p(r —GyM)+(GyM)? =0, 
or, (29 + GyM)* — 4pr = 0, 
Si. se (1 ee aM", (8.118) 
which leads to (see (8.112)) 


B(r) = is = (14 St)" (8.119) 


Furthermore, using (8.118) we note that 


Afr) = 1- 28M, 2GuM 
r p (1+ Gui ) 
2 
p(1+ Suit (S") ) ~DGeMt 
pte) 
(= 
20 
(1+ - ) 


Therefore, the conformal line element can be written in the unique 
form 


2 
1 GnM 4 
M 
dr? = (eS) al dt? — (1+ 3: ) ds”. (8.121) 


CHAPTER 9 


Tests of general relativity 


There are several tests of general relativity that we can do with the 
Schwarzschild solution obtained in the last chapter. We have already 
talked about the gravitational red shift in section 6.3. That essen- 
tially tests the principle of equivalence. In this chapter we describe 
some other experiments which directly test general relativity. 


9.1 Radar echo experiment 


° © 
planet 


earth 


sun 


Figure 9.1: Radar echo experiment. 


Let us consider a small planet such as Mercury or Venus between the 
earth and the sun and assume that there is an observer on earth in the 
gravitational field of the sun as shown in Fig. 9.1. Let the coordinates 
of the observer and the planet at a given time be (r1,61,¢,) and 
(r2, 02,62) respectively (r; > r2) with the sun defining the origin 
of the coordinate system. Furthermore, let us assume that when 
they are aligned, the observer on earth sends a radar signal which 
is reflected by the planet and is received back on earth. We are 
interested in calculating the time interval between the transmission 
and the reception of the radar signal. In special relativity light signals 
travel with the speed of light (c = 1) and hence the time interval is 


265 
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the same as twice the radial separation between the two planets given 
by (see (8.104)) 


TL TL 1 
M\"2 
At = 2 far = 2 f ar (1 sen ) (9.1) 
r 
T2 T2 


Assuming the gravitational potential of the sun to be small we can 
approximate this as 


At, & 2 f ar (14 om) 
- 
r2 
TL 
= 2 ln —1r2)+GyMln 7 : (9.2) 
2 


On the other hand, looking at the Schwarzschild metric (8.50) 
we know that since for the photon dt = 0, we have (motion is along 
the radial direction so that d@ = 0 = d¢) 


M 2GyM\~" 
(:--=*) ae = (1 an ) dr?, 


dr | 2GyM 
oO, a= (1 iar ) : (9.3) 


This is the coordinate velocity of light. Therefore, the coordinate 
time interval taken by the signal for the same process is given by 


ry 


TL 
dr 2GyM 
r S 


T2 


= 2 ln — rz) + 2GyM In 7 (9.4) 
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The proper time interval recorded by the observer on earth is 
obtained from the coordinate time interval to be 


1 
AT = 960 (71) At 


1 
= (1) ae 


ry 


(1 a a) x2 G — ro) +2GyM In 4] 
TY r2 


iveece 
i fim ra) + 2Guarn SPT ra), 
r2 ry 


d 


(9.5) 


Here we have restricted only to the dominant terms. However, this 
already shows that there is a difference in the time interval predicted 
by special relativity and general relativity. In fact 


(9.6) 


At — At, © 2GyM In Pie ae : 


i) ry 


This time interval has been measured experimentally and the 
result agrees with the theoretical prediction up to about 20 percent. 
However, one can criticize such experiments on many grounds. First 
of all the planet involved is not a point object. The earth is not 
stationary, but is moving in its orbit. Both the planet as well as 
the earth produce gravitational fields which have not been taken 
into consideration in the calculation. In recent years, however, such 
questions have been taken into account and experiments yield a 5 
percent agreement with the theoretical result. These tests come from 
the tracking of the satellite systems where the satellites are relatively 
small and can be treated as massless objects. Thus the time delay 
experiment marks a good success of general relativity. 


9.2 Motion of a particle in a Schwarzschild background 


The Schwarzschild metric (8.50) is given by the line element 


= 
4,2 = (1 = os) dt?— ( E oo) dr2—r?(d62 + sin? 6d¢2), 
rf T° 


(9.7) 
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where Gy is Newton’s constant and M is the mass of the star pro- 
ducing the gravitational field. Let us now consider the motion of a 
particle of unit mass in the gravitational field of this star. Since the 
particle is massive, we can use its proper time to label the geodesics. 
We will discuss the motion of the photon separately in the next sec- 
tion. For the present problem, therefore, we can write a Lagrangian 


for the particle motion as (up to a factor 5) 


L = gyite” 


at 
7 (: 7 st 2. (1 - a 72 — 72(62 + sin? ad), 
y fe 


r 


(9.8) 


where a dot denotes differentiation with respect to the proper time 
7. Let us note that t and ¢ are cyclic variables and, consequently, 
we would have constants of motion. However, before we write down 
all four equations, let us look at the @ equation which leads to 


OL Ol. 
dr 96 O06 ” 
or, < (—2r76) + 2r? sin @cos@ ¢? = 0, 
or, —2 < (r26) +r? sin 20 6? = 0. (9.9) 


Therefore, we see that if the particle motion initially is in the equa- 
torial plane, namely, if initially we have 
6=x/2, 6=0, (9.10) 
then, 
d 9»; 
— (r°0) =0, 
dr ee) 
or, 7?6=constant =0, (9.11) 
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where the vanishing of the constant follows from the fact that its 
value is zero initially. Therefore, the angle @ for the particle does not 
change with time and the motion of the particle always lies in the 
equatorial plane. 

Consequently, to simplify the problem, we would consider the 
motion only in the equatorial plane (0 = 4). The Lagrangian (9.8), 
in this case, has the form 


1 
L= (1 = a PS (1 = ae pr — 72 g?, (9.12) 
r r 


The ¢ equation following from this Lagrangian leads to the relation 


4 ab ab _, 
dr a6 O06 ” 

or, < (—2r7¢) = 0, 

or, r’¢=constant = &. (9.13) 


Similarly, the t equation yields 


d OL OL 
dr Ot at 
or, a (2 (1 _ ae i = 0, 
dr ig 
or, (1 — a t=constant =k. (9.14) 


These relations lead to the two conserved quantities associated with 
the problem alluded to earlier. In particular, we note that the ¢ 
equation can be thought of as leading to the conservation of angular 
momentum. 

Furthermore, the r equation has the form 
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d /dL OL ‘ 
dr " Or” 
-1 

or, a a) ,— 2Gni4 Fa _ 2GnM pp 

dr ‘ Te 

-2 
© (1 - a Pant #2 + 2nd? =0, 
r r 
2GnM\~1 IGEN * OGaM 
or, -(1- aa ) po ) an i? 
r r r 


r2 


_ GM jy _ (1 : re GuM . 
r 


( GxM Ys ( 2GxM\" GuM .» 
or, 1, = —— r— | 1—- — Tr 
r 


i GyM 


3 ?—r¢? =0. (9.15) 


Although we can try to solve the radial equation (9.15), it is 
often more convenient to solve for r from the equation we obtain 
from the line element (9.7) (with 6 = 5), namely, 


-1 
1= (1 2 a —_ (1 = am Pp — 7g, (9.16) 
r 


a 


Dividing (9.16) throughout by é? we obtain 
| 2GuM\ ®  (,_2GuM\" fdr\* 9 _ 1 
r 2 r d¢ jee ge’ 


2 2 59 
or, (=) — (1 - a ae +r? (1 - “ot) 
dg r ¢ r 
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dr\? 5 IGE FT, 5 2GyM 
i: Gy) = eee): 
(9.17) 


where we have used (9.13) and (9.14). If we now define a new variable 


1 dr 1 du 
aa eee amie ae) 


then equation (9.17) simplifies to 


1 /du\? i 1 2 
laa = (1 — 2GyMu) + Ba((k —1)+2GyMu), 


dé 
1 


2 


Upon integration, this leads to 


2 
or, (3) — —uv(1 _ 2GyMu) + a((e _ 1) + 2GyMu) 


((k? — 1) + 2GyMu) — u?(1—2GyMu). (9.19) 


[eo= {[ ——__“ _____., 
[((k2 — 1) + 2Gy Mu) — @u2(1 — 2GyMu)]? 
o, = do4 f ——__#f_____, 
[((k2 — 1) + 2GyMu) — @u2(1 — 2GyMu)]? 
(9.20) 


As a result, we see that the value of ¢ can be determined as a 
function of r or conversely the value of r as a function of ¢. This is 
an exact result. However, it is not easy to see from this relation the 
elliptic form of the orbital motion of the particle (planet) that we 
expect. To see this, we differentiate (9.19) with respect to ¢@ which 
yields 


du d?u 2GyM du du 9 du 
OE ne ices SNES eo ee My — 
ge ae a a 


du (d?u GyM 2 
a & — “tu 3GyMu ) 2G, (9.21) 


or, 
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One of the solutions of (9.21) is 


or, 1() = constant, (9.22) 


which corresponds to circular motion. However, the more interesting 
solution of (9.21) comes from 


d?u = GyM 
d¢2 


—u+3GyMu’. (9.23) 


To understand the meaning of this equation, let us look at the 
Newtonian problem of a point particle of unit mass moving in the 
equatorial plane in the gravitational field of a star for which the 
Lagrangian is given by (here a dot denotes a derivative with respect 
to t) 


1 1 5; GyM 
Lee poe (9.24) 
2 2 
The ¢ equation following from this gives (as also in (9.13)) 
r?¢ = constant = &. (9.25) 
The r equation, on the other hand, has the form 
. M 
ij—rd? + BLE = 0, 
r 
. € GyM 
or, Fa + 3 ==): (9.26) 
As before, let us change variable as 
1 
as 9.27 
wat, (9.27) 


so that we have 
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dr dédr_ 4 (- a)--0$ 


" “dt dt dg wd) dé 
Z dr dé d (fdr 5 id du 
f=] ae = ee eS | 
dt? dt d@ \dt de de 
d7u 
= 22 


Thus the classical Kepler problem (9.26) leads to 


2 
a ais ae = Pye a GuMu2 = 0, 


d¢? 
2 
M 
or, — = oun —U. (9.29) 


This equation is known to lead to elliptic orbits and is similar to 
(9.23) which we have derived from the Schwarzschild metric. Com- 
paring the two we conclude that the term 3Gy Mu? in (9.23) must 
represent the relativistic correction to the particle motion. There are 
now two special cases which we consider below. 


9.2.1 Vertical free fall. Let us consider the case where a massive par- 
ticle (m = 1) falls radially in the gravitational field of a massive star. 
In this case 


=0, (9.30) 


which leads to (see (9.13)) 


£=0, (9.31) 


and consequently we cannot divide by ¢? as we had done earlier 
in (9.17). In this case the equation from the line element in (9.16) 


becomes (¢ = 0) 
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-1 
(1 a a) i (1 = so f=, 


r r 
k2 pe 
a 2GnM 2GnM ? 
Es) 
2GnM 
or, pe — ke + (1 —s ) = 0) (9.32) 
r 


This clarifies the meaning of the constant k. For if the particle falls 
from rest at some value of r = rp, namely, if 


Ae s=0 (9.33) 


then (9.32) implies 


ee (1 _ a ar (9.34) 


TO 


where the positive sign of the square root is taken to signify that 
coordinate time increases with 7 (see (9.14)). Equation (9.34) shows 
that the constant k is not a universal constant, rather its value de- 
pends on the particular geodesic of the particle. For example, if the 
geodesic corresponds to the particle being at rest at a coordinate 
infinitely far away from the star then 


k=1. (9.35) 


Putting in this form of k into (9.32) we have 


1. 
or, 5 fr =GyM (+ 7 ~) (9.36) 


Differentiating this equation with respect to 7, we obtain 
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ay GyM . 
ro= —— fF, 
r 
M 
or, # = oon ; (9.37) 
r 


Both these equations in (9.36) and (9.37) are similar to the classical 
Newtonian equation. The first one is analogous to the statement that 
a particle falling in a gravitational field gains an amount of kinetic 
energy equal to the change in its potential energy. The second form 
merely represents Newton’s equation for the radial fall of a particle 
in the gravitational field of a massive object. However, we must 
remember the differences also. In particular, r does not measure the 
radial distance as we have seen and dots here refer to differentiation 
with respect to proper time. 

Equation (9.36) also allows us to calculate the proper time ex- 
perienced by a particle falling from rest at r = ro. Thus if we assume 
r=ro at T =0, then 


r! TO 


4 
I 


NIH 


ia) 
7 1 ’ ror’ 
- groan |* (GER) ss 


Here the limits of integration are chosen such that the proper time 
is positive (that is, the radial velocity actually decreases vectori- 
ally). The limit of integration can actually be extended up to the 
Schwarzschild radius r = rs = 2GyM unless the physical radius 
of the star is larger than the Schwarzchild radius. We also note 
that the proper time experienced by the particle in going up to the 
Schwarzchild radius is finite since there is no singularity in the inte- 
grand. 

On the other hand, if we look at the coordinate time we note 
from (9.14) that 


ee ey (9.39) 
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Integrating this we have 


mb 
II 
Qa. 
4 
SoS, 
—_ 
| 
Q 
Siz 
s 
wN_” 
NIR 


1 
i dr’ (1 ~~ | ; ror’ 3 
/ /2GyM (1 7 26x) ( ) ; (9.40) 
is 

This clearly shows that the coordinate time diverges as we approach 
the Schwarzschild radius. In fact, the integrand is singular at the 
Schwarzschild radius. 

Let us next calculate the coordinate speed of the particle from 


the definition in (9.36) 


dr 
dt 


r 


aye acu oe (9.41) 


We can obtain the coordinate speed from this as 


dr dr dr Reais Ye: 
un = => aan! r 
1 — 2GnM 
a). 
k 


1 


(1 _ 8) 2 


TO 


= oS) lacar om) 
or 


where we have used (9.39). 

This shows that although 7 remains finite as we approach the 
Schwarzschild radius, the coordinate speed goes to zero in that limit. 
As a result, the particle takes infinitely long to reach the Schwarzschild 
radius. Furthermore, setting 


du 
an 0, (9.43) 
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from (9.42) we obtain 


2GyM TO oer fe 3 
r2 ror 


Li 
+ (1-260 ; (2) ‘(-3)|=0 
r 2 ror r 
1 
or, 2GyM — = (1 — ) 


1 2GyM 
or, 2GyM(ro 1) — 5 (:- : ) ror =0 


or, r= =. (9.44) 


This analysis shows that if the particle falls from rest at infinity 
(r9 — co), its coordinate velocity increases until 


r=6GxM, (9.45) 


after which it decreases and goes to zero at the Schwarzschild radius. 
Let us compare this behavior with the classical Newtonian result 
which gives 


u(r) = ea (9.46) 


r 


and hence leads to the conclusion that the speed of the particle keeps 
increasing and diverges at the origin. 


9.2.2 Circular orbit. For a circular orbit we have 7 = 0 = 7, and, 
consequently, the r equation (9.15) takes the form 
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Sa re 

z 

# _ (ag)* _ GxM 
ee eee cc eee 


o fmf (ea) e= (Gy) fae oar 


Therefore, the period of the orbit is given by 


rs \2 f 


This is exactly like Kepler’s law which says that the square of the 
period of an orbit is proportional to the cube of the radius. However, 
we must remember that here r does not denote the true radius of the 
orbit and At measures the coordinate orbit period. 

If an observer is at rest at a distance rg, then the proper time 
of the orbit that he would measure is (see (9.7)) 


1 
Ae = Aon) A= (1 : ae At. (9.49) 
0 

We note here that as r9 > co, At = Ar. Hence by measuring the 
period of the orbit we can measure the radius of the orbit. 

At this point, we can inquire about the period that an observer 
moving with the particle or the planet would measure. We note from 
(9.39) that 


aN pe (1 - a k-1At, (9.50) 
r 


so that the value of the period depends on the constant k. To deter- 
mine the constant & we note that in this case rf = 0 and hence from 
(9.16) we obtain 


1= (1 = sett =e O (9.51) 


r 
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On the other hand, we know from (9.47) that for circular orbits 


GyM . ‘ 
a ae (9.52) 


r 


Using this in (9.51) we obtain 


2 

2G M 
oS) 
3GnM \’ 
=) 


so that (9.50) leads to 


AT = (:- =") k-tAt 


or, k= (9.53) 


ul 
or, Ar = (i-=")" At. (9.54) 


Note that since k? is positive, bound orbits are possible only if r > 
3GyM. Furthermore, when r = 3GM, Ar = 0. This shows that 
photons can be bound in circular orbits only if r = 3GyM (since the 
proper time vanishes for photons). We will come back to this point 
in the next section. 

As a simple application of these results, let us consider two 
observers on a planet which is moving in a circular orbit of radius r 
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around a star. One of the observers takes off on his space craft and 
maintains his position at a fixed point near the orbit of the planet 
and then joins the planet at the end of a period. Thus the time of 
absence as measured by the observer on the space craft is given by 
(see (9.49) ) 


1 
M\2 
Ne (1 z a At, (9.55) 


r 


whereas the observer on the planet would measure the time of ab- 
sence as (see (9.54)) 


1 
Ato = (1 = a * At, (9.56) 


r 


which leads to 


i 
Asp 1 _ 2GnM 2 
r 


This shows that the observer moving in the space craft would find 
himself older (if they are twins) on return. This should be contrasted 
with the result from special relativity where the twin on the space 
craft would believe that he would be younger on return. 


9.3 Motion of light rays in a Schwarzschild background 


To consider the motion of light rays in a Schwarzschild background, 
we have to make two assumptions. First of all we assume that the 
light rays behave like any other particle and hence their motion in a 
gravitational field is given by the geodesic equation (see (5.83) ) 


d2x! a aa da* 
dr? ma, dr a. (9.58) 


Furthermore, we assume that the proper time interval for photons 
generalizes from Minkowski space-time to be 
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dr? = 0. (9.59) 


Clearly, therefore, we cannot use 7 to label the geodesics. Instead we 
would use a different affine parameter, say A to label the geodesics. 
Thus the geodesic equations for the photon are similar to other par- 
ticles and in particular if we specialize to 6 = 3, the equations are 


given by (see (9.13), (9.14) and (9.15)) 


r?@ = ¢ = constant, 


IGuM\ . 
(1 mapas ) t = k = constant, 

r 

IOI 2GyM \~°Gy M72 

P= r= (t= : 

r r r 

Ga 

oer — rd? =0, (9.60) 


where we have to remember that the dots now refer to differentiation 
with respect to the affine parameter A. The only difference from the 
particle equation, in this case, lies in the constraint relation 


dr? = 0, (9.61) 


which translates to 


dx" dx’ _ 
oe ax a 8 
—1 
or, (1 - ae —_ (1 = ae fp — 779? =0, 
r r 


2 -1 2 
or, (1 = an ue = (1 = N) (<<) —r*=0, 
r 2 r dd 


IGM k? r4 IGyM\—l/ dr\? 
or, 1 — —— 2 RR Ey, ee 1- = 
r (1 _ ae] £L r d@ 
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2 24 
or, (=) ES (1 = aut) re “A =i; (9.62) 


As before, we define u = 4 so that equation (9.62) becomes 


2 
(<3) + (1-2GyMu)w? — — =0. (9.63) 


Differentiating this equation with respect to @ we obtain 


du d?u du 9 du 
do ae A aaa ade de = 0, 
du (d?u 9 
or, do (3 i roe 3Gy Mu ) = 0. (9.64) 


The radial geodesic equations, therefore, lead to 


du _ 0 d?u 
do’ d¢2 
The first equation implies a circular orbit where the radius vector 


does not change. Setting 7 = # = 0 in the radial equation in (9.60) 
we see that for this to be true 


+u—3GyMu* = 0. (9.65) 


GuM jp _ re. (9.66) 


r2 


On the other hand, the line element equation (9.62) for this orbit 
gives 


(1 - a =r’. (9.67) 


Comparing the two equations we determine 


or, r=38GyM. (9.68) 
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This shows that circular orbits for photons are possible only if the 
radius of the orbit satisfies the above condition. But we note here 
that this value of the orbit radius for most objects (stars) lies inside 
the physical radius of the object (see discussion after (8.52)) and 
hence such an orbit is not physically meaningful. Thus the radial 
equation yielding the true orbit that we can observe for photons is 
given by the second equation in (9.65) 


OO a apne RS (9.69) 
—  +u- (ae Uf ; 

d¢? 
The last term, of course, is a relativistic correction (as we have seen 
earlier at the end of section 9.1) and is small compared to the second 


term as we can see from the fact that 


3Gy Mu? 3. 2GyM 3 
Se SoS ee, (9.70) 
U 2 Tr 2T 
even if we consider the light ray near the physical surface of the 
planet or the star. Thus neglecting the last term we can ask what is 


the trajectory of the light ray to the lowest order satisfying 


d2 
ae +u=0. (9.71) 


The solution of this (harmonic) equation is given by (with a partic- 
ular choice of the phase) 


u(d) = Asin ¢, 


1 
or, —=Asind, 
r 
: 1 
or, rsing= ria constant. (9.72) 


We recognize that rsing represents the value of the y coordinate 
(of the photon). Thus the solution to the lowest order represents 
a straight line parallel to the x-axis and to lowest order light rays 
travel in a straight line. We see here that < measures the distance 
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of closest approach of the photon to the star. Denoting 5 by ro, the 
lowest order solution of (9.69) can be written as 


“= Z sin ¢. (9.73) 


TO 


Figure 9.2: Photon trajectory near a star at the lowest order. 


Equation (9.73) shows that light rays are unaffected by the grav- 
itational field in the lowest order as shown in Fig. 9.2. To find the 
effects of the gravitational field we have to solve equation (9.69) 


d?u 7 2 2 

Te +u = 3GyMu* = eu’, (9.74) 
to the next order. Here € = 3G, M is a small parameter which can 
be used as a parameter of perturbation in determining the solution. 
We expect the true solution to differ only slightly from the lowest 
order solution in (9.73). Thus perturbatively we can write (to the 
next order) 


1 
us —sing+ eur, (9.75) 
r0 


and putting this into the equation and keeping terms only up to 
linear order in € we have 
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1 d? i 1 : 
——singd+e atl + (— sino + eu) =e (= sin + eu) ; 
ro T0 


de 
SE yy hea pe oe aa) 
or, dee U1 =e€E re S =e€E or 2 CO; 5 
fe By coed (1 — cos 20) (9.76) 
or, dee 1= Ore . . 


This suggests that we can choose the solution to be of the form 


u, = a+ bcos 2¢, (9.77) 


which, when substituted into (9.76), leads to 


je 
—4b cos 2¢ + a + bcos 2¢ = =z (1 — cos 2¢), 
"9 


Ae 
or, a-— 3bcos2¢= 52 5 (1 — cos 2¢). (9.78) 
0 
This determines 


1 1 
ce, t2—5, (9.79) 
2r6 6r5 
so that we can write the solution (9.75) with the leading order cor- 
rection as 


i 1 1 
u(d) & ere (s+ 30082) ; 
1 1 3GyM 1 
or, = 7 ane + oe (1 + 3 cos 26) , (9.80) 


At infinite separations if we assume the light ray to come in at 
@ ~ 0 then it will go back to infinity at @ ~ a. If the relativistic 
corrections are not present, then of course, the angle of incidence is 
exactly @ = 0 and the angle of exit is 6 = 7. Let us assume that 
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61 Os. 


Figure 9.3: Bending of a light ray near a star at the leading order. 


because of the gravitational effects, the light ray comes in at 6 = —6; 
and goes away at @ = 7+ 62 as shown in Fig. 9.3, where 6; and 62 
are small angles. Clearly this implies that at incidence we have 


1 3GyM 1 
— sin(— 1+- —26 =0 
sin(—d,) + ara ( + 3 cos( ») , 
“) 3GyM 1 
o, —-—+— (1+5) = 0, 
TO 2r6 3 
2GynM 
* ooo (9.81) 
0 


Similarly, at the other end we have 


1 3G M 1 
a sin(a + 62) + Dr? (1 + 3 cos(27m + 22) =O, 
in 6: 3GyM 1 
or, ee + ~ (1 + = cos 252) = 0, 
To 2ro 3 
3Gy M 1 
or, _o2 + ~ (1 + 5) = 0, 
TO 2r6 3 
2Gy M 
or, 62 =———. (9.82) 
ro 


This shows that the bending in the path of the light ray (from the 
leading order trajectory) as it comes in is the same as that when it 
goes out. This is, of course, expected from the spherical symmetry 
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of the problem. The total amount of deflection in the path of the 
light ray, in the presence of the star, is given by 


4GyM 
6, +62 = (9.83) 


and is larger when the impact parameter is smaller. The bending 
of light, as it is commonly called, can be experimentally tested and 
is usually done by studying the light rays grazing the surface of the 
sun. In this case the theoretical prediction for the deflection is 1.75”. 
Earlier measurements of this phenomenon involved photographing 
the stellar images during solar eclipse and then comparing them with 
those taken six months later when the sun is not in the field of view. 
This shows a deflection of 1.5” ~ 3.0”. We must remember that 
this is a very difficult experiment and various factors change during 
the six month interval. Consequently measurements cannot give very 
accurate results. In recent years one uses radio telescopes to measure 
the radio signals from point like sources such as quasars as they pass 
near the sun. Such measurements give a value of 1.57” — 1.82” for 
the deflection of the signal with an error of .2”. This is a fairly good 
test of the theoretical prediction. However, errors need to be reduced 
still further. 


9.4 Perihelion advance of Mercury 


We have seen in (9.29) that the general equation for planetary mo- 
tion in classical physics is given by (recall that the planet has been 
assumed to be of unit mass) 


du _ GM 
dd? 


Elie (9.84) 


Therefore, the general solution can be written as 


u(o) = : = — (1 + ecos(¢ + 4)), (9.85) 


where e and 6 are constants. By a suitable choice of the coordinate 
axis, we can choose the phase 6 to be zero in which case the solution 
takes the form 
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GyM 
uo) = re (1 + ecos¢). (9.86) 
To determine the constant e we note that the total energy of 
the classical motion is given by (see (9.24)) 


i. CSM 
| ea ee 
See ; 


1 (“ a) 12 GyM 


2 \dt dd 2 r2 r 
1 du\? 1 
_ 4+ 2 (au +22 
= 5 (3) +5 Cu? —GyMu, (9.87) 


where u = 4 and we have used (9.25). We can rearrange this to write 


du\* 2E  2GyM , 
(=) =p ge. te Mes (9.88) 


Substituting the actual form of the solution from (9.86) we obtain 


2 2 
(= esin = +2 (S*) (1 + ecos @) 


2 
-(S") (1 + ecos ¢)?, 
GyM\* . 2E  (GyM\? 
Re Nee heart eae 
2b 
Or, e2 —h + GM (9.89) 


It is worth recalling here that for bound orbits, the total energy 


E <0 so that the eccentricity of the orbit e < 1. (e = 0 leads to 


_1_ ££ 4 ; , 
CS = Gut corresponding to circular motion.) 
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With this value of the constant, the radius takes values accord- 
ing to 


GyM 
u(d) === <= (1 +¢c08 4). (9.90) 


The perihelion (closest point to the star) and the aphelion (farthest 
point from the star) on the orbit are determined from the condition 
that at such points 


0; , —=0, 9.91 
ap oO, 5 F (9.91) 
which implies ¢ = 0,7. This fixes the perihelion and the aphelion to 
be at 6 = 0 and ¢ = 7 respectively. The distances of these points 
from the star are given by 


2 1 
— ——_- — 7 
ae CaM Tae 
2 1 
= ah. 9.92 
Tes Culf-lee 7 we) 


GuM 
Uap = ZB (l-—e) =u, 
GuM 
Upy = B (1+e) =u. (9.93) 


Parenthetically, we note here that u,,u2 are simply the two roots of 
the quadratic equation (9.88) with (9.91) and (9.89). 

Let us now include the relativistic effects. We know that in the 
Schwarzschild background (in the relativistic case) the motion of the 
particle (planet) is described by (see (9.19)) 


pate = + — 2GyM 
(<3) ap Cee Gy Mu 


2E  2GyM 
= ew IG. (9.94) 
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Here we have made the identification 2K = k? — 1. Furthermore, 
we recognize that the last term in (9.94) represents the relativistic 
correction (compare with (9.88)) and noting that 2Gy M is a small 
quantity we can denote it by € and write 


a) ~@ a u? + eu. (9.95) 


(4) _2E 2GyM 
We can now ask how the perihelion and the aphelion ((9.92) or (9.93)) 
would modify when we include this relativistic correction. Using 
(9.95) these positions are determined from (see (9.91)) 


du _ 4 
d@é : 
2GyM 2E 
or, eu — uw + —— ut = 0, 
1 2GyM 2E 
3 2 N = 


This is a cubic equation with three solutions u1, u2, ug such that 


1 
Uy + Ug + U3 = e (9.97) 


which is a large quantity since € is small. On the other hand, we 
have seen that the aphelion and the perihelion given respectively by 
u; and ug are small (see (9.93)) compared to 4+ and hence it is the 
third solution which must be large and unphysical and we can assume 


the motion to be bounded by 


Ur SU < Ug. (9.98) 


As a result, we can write (9.95) as 
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du\* 3_ Ww 2GyMu | 28 
d@¢ = le € € el? 


= e(u—Uuz)(u2 — u)(ug — u) 


Ls, Maia Nts w (2 seca w) 


€ 


= (u—u1)(ug—u)(1— e(u + ui + U2); 


oy ag = [(u — w1)(u2 — u)]? (1 — e(u + a +.u2))?, 
ee dg _ (1= e(u + + ta) 
du Dy 


[(u — v1 )(u2 — u)] 


1+ 5 e(u+u + U2) 


~ (9.99) 
[(u — u1)(u2 ~ u)]? 
Let us define 
1 1 
aed) (ui + ua), B= 3 (uz — wu), (9.100) 
so that 
uy =a 8B, ug=at+8, (9.101) 
and it follows that 
(u—ui)(uz—u) = (u—(a—f))(a+ 8 —u) 
= (6? =(@=e)]. (9.102) 
Thus, we can write (9.99) in the form 
do _ 1+ geut+ea _zeu-a)tlt+5ew (9.103) 


du (P= (ua)? [B= (ua)? 
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Integrating this, we can obtain the change in the angle between the 
perihelion and the aphelion as 


5A) = [ow [5 e(u—a) +143 ea] 


| 
| 
I 
On 
— 
bo 
| 
—— 
S 
| 
Q 
Ww 
ae 
Nie 
— 
fo 
i 
+ 
I 
an 
Q 
nN 
a 
=) 
AF 
S 
| 
Q 
a 


_ (1 +5 ca) (sin~!(1) — sin“!(-1)) 


_ (1 +5 ca) (5+) =n (143 ca). (9.104) 


Doubling this gives the angle between successive perihelia and shows 
that it does not return to its original position, rather it advances by 
an angle 


1 
Ad = 2\6¢| — 20 = 37€a = 3m X 2GYM x 5 (u1 + uz) 
1 1 
= 3nGyM (= i =) (9.105) 
TL r2 


Clearly, the advance in the perihelion is a relativistic effect and 
is larger for planets closer to the sun. Therefore, Mercury is the 
natural candidate to study this phenomenon. Even for Mercury this 
angle is very small, 


(Ad) /revolution ~ 0.1038”. (9.106) 
However, this effect is cumulative and per century the perihelion of 


Mercury advances by 


(Ad) /century ~ 43”, (9.107) 


where we have used the fact that Mercury’s revolution period is 88 
days so that in a century there are about 415 revolutions. This 
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advance of the perihelion (9.107) is appreciable and can be observed. 
In fact, the astronomer Leverrier had already discovered in 1845 
that Mercury exhibited an anomalous precession of the perihelion 
of about 43” per century that could not be accounted for by known 
perturbations at the time. Thus the precession of the perihelion of 
Mercury constituted an early verification of general relativity. There 
are numerous other tests that show that general relativity conforms 
to astronomical observations quite accurately. 
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CHAPTER 10 


Black holes 


We have already seen that the Schwarzschild metric given by 


2GyM St 
dr? = ( = a dt? — (1 = oe dr? —r?(d6? + sin” 6d’), 
(10.1) 


has manifest singularities at r = 2G M and at r = 0. The singularity 
at the origin, if the particle is a point particle, is, of course, a real 
singularity which is present even in the classical case. The singularity 
at the Schwarzschild radius r = 2GyM, on the other hand, has a 
different character. Note that although 


gu(r = 2GyM) = 0, (10.2) 


the determinant of the metric is well behaved and that if we calculate 
the components of the curvature tensor, they are also well behaved 
at that point. Thus, we conclude that the point r = 2Gy.M cannot 
represent a physical singularity of the space-time. However, we have 
also indicated earlier that for r << 2GyM (see (8.61)) 


tt < 0, Orr > 0, (10.3) 


so that the time and the radial components of the metric (and, there- 
fore, the coordinates) seem to exchange their roles and hence there 
must be some interesting phenomena in that region. This is what we 
will investigate in this chapter. 


295 
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10.1 Singularities of the metric 


Whenever the metric components are badly behaved at a certain 
point in a coordinate system, one of two things may happen: 


i) the space-time geometry is, in fact, singular at that point, 


ii) the space-time geometry is nonsingular but the coordinates fail to 
cover a region of space-time containing that point properly. 


Normally the first possibility can be demonstrated by calculating the 
scalar curvature and showing that it diverges at some point of space- 
time. Furthermore, this singularity must occur at a finite value of 
the affine parameter labelling the geodesic. That is, the singularity 
must not lie at infinity. However, even this is not fool proof because 
there are singularities of space-time where the scalar curvature is not 
necessarily singular. The second possibility is usually demonstrated 
by finding a coordinate system where the metric components are 
well behaved. Thus the study of singularities of a manifold is an 
extremely difficult one. 

We would begin by illustrating with simple examples how a 
wrong choice of coordinates can make the metric components look ill 
behaved. Let us consider the two dimensional line element given by 


dee a dP aan (10.4) 


defined over —oo < x < co and 0 <t< ow. The metric in this case 
appears to have a singularity at t = 0. However, the true nature 
of the space-time geometry can be seen by making the coordinate 
transformation 


= a (10.5) 


so that the line element becomes 


dr? = dt? = dz’, (10.6) 


with —oo < x < ow andt > 0. But this is nothing other than 
the upper half plane of the two dimensional Minkowski space-time. 
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Furthermore, the singularity has now disappeared and hence we can 
extend the form of the line element to the entire Minkowski space. 

The second example is again of a two dimensional space known 
as the Rindler space and is closer in analogy to the Schwarzschild 
metric than the previous example. The line element, in this case, is 
given by 


dr? = x7dt? — da?, (10.7) 


with —oo < t < co and 0 < x < ow. It is obvious from (10.7) that the 
metric is singular at « = 0. That is, at this point the determinant of 
the metric vanishes and consequently the contravariant metric com- 
ponents are undefined at such points. On the other hand, calculation 
of the scalar curvature shows no singularity as x + 0 suggesting that 
the singularity may be a coordinate singularity. In fact, the curva- 
ture for the Rindler space vanishes identically suggesting that this 
space may be a part of the Minkowski space. We ignore this point 
for the present and continue with our analysis of the singularity. 

To find a coordinate system that would get rid of the singularity, 
let us look at the null geodesics in this space. The null condition leads 
to (dots denote derivatives with respect to the affine parameter A) 


re —¢* = 0; (10.8) 


which yields 


or, ¢t=+Inz+ constant. (10.9) 


Thus the outgoing and the incoming null geodesics are naturally 
given in terms of the null coordinates 


u=t—Ing, v=t+Ing. (10.10) 


Namely, an outgoing null geodesic is given by 
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u = constant, (10.11) 


while an incoming null geodesic is given by 


v = constant. (10.12) 


In terms of these coordinates, then, we can write 


1 
i= gv +u), o= exe) (10.13) 


so that the line element (10.7) takes the form 


dr? =e’ dudv. (10.14) 


We are still not in a position to analyze the singularity at 7 = 0 
since the coordinate range —co < u < oo and —oo < uv < ow still 
corresponds to x > 0 of the Rindler space. However, let us recall 
that for this space, t is a cyclic variable in the Lagrangian density, 
i.e., if we take (see, for example, (9.8)) 


Laer a, (10.15) 


where dots denote derivatives with respect to the affine parameter 
X, it follows from the t equation that 


a (al) _, 
di \dat) 


OL : 
or, — = 277i = constant, 


ot 


dt 
or, 2 ax E = constant. (10.16) 


Furthermore, we note from (10.13) that 


uty = 2t, (10.17) 
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so that for a constant u (outgoing geodesic, see (10.11)), 
dv = 2dt. (10.18) 


As aresult, for the outgoing geodesic for which u = constant, (10.16) 
can be integrated to give 


= a Deas (v—u) 
At: = pusz face =5q [eve 


(v-u) 


eu 
= constant + = constant + (=) e”. (10.19) 


Similarly, for an incoming geodesic we can determine 


at, _i 2_ 1 (v—u) 

An = pusz fawraz [ave 
= constant — ao ent (10.20) 
= consta op} e : 


This shows that the affine parameters lead to incomplete null 
geodesics. This also suggests that if we now define new variables 


U=-e™, Vee (10.21) 


then, the line element (10.14) can be written as 


dr? = dUdV. (10.22) 
The ranges of the new variables are given by U < 0 and V > 0, but 
since there is no singularity in the line element any more, we can 
extend it to cover the entire space-time 

—o0 <U < ow, —oo < V <0. (10.23) 


The final coordinate transformation 
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UV V-U 


T=, X=, (10.24) 
renders the line element (10.22) into the form 
dr? =dUdV = dT? — dX’, (10.25) 


which we recognize to be the two dimensional Minkowski line ele- 
ment. 


t=0,t— c 


x = const 
t = const 


z=0,t—- —oo 


Figure 10.1: Rindler space-time. 


The original coordinates (t,x) can be expressed in terms of the 
Minkowski coordinates (T,X) as (see (10.13), (10.21) and (10.24)) 


i) ed 


2 = e2 4) = (-UV)2 =(-(T— X)(T + X)) 


= (7 = 77)2, 
1 1 mae 
t 5 (tu) 5 (nV n(—U)) ae 


= tanh! (x) ' (10.26) 
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We see from Fig. 10.1 that the original Rindler space corresponds 
to the wedge X > |T| of the two dimensional Minkowski space-time. 
The original singularity has now disappeared and we can extend the 
metric to the entire Minkowski space-time. 


10.2 Singularities of the Schwarzschild metric 


The analysis for the Schwarzschild metric can be carried out in an 
analogous manner to that of the Rindler space. The main difference 
lies in the fact that the Schwarzschild solution is four dimensional in 
contrast to the two dimensional Rindler problem. However, we can 
effectively reduce the Schwarzschild problem to a two dimensional 
one by simply restricting the motion to 6 = 4 (equatorial plane) 
with a fixed azimuthal angle (¢ = 0). In this case, the line element 


takes the form 


-1 
dr? = (1 = a ae = (1 — at dr?, (10.27) 


r 


which has a two dimensional character and the null condition in the 
present case is given by 


-1 
(1 = a a (1 = at p30; 


r r 
2 2 
we (8) = (ee) = (i 
? dr 1 i. 26nM eM 7 1 : 
dt ee 1 
Or, Ta 5) 
dr IM —1 


x 
or, i [ + 2GyM In lyon — 1)] + constant 


= +7 + constant, (10.28) 


where 7 defined as 


F=r+2GyMIn ( = 1) . (10.29) 


r 
2GyM 
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is known as the Wheeler-Regge coordinate. Therefore, the null co- 
ordinates in the Schwarzschild space are given by 


u=t-F v=t+h, (10.30) 


so that the outgoing and the incoming null geodesics can be written 
respectively as 


u = constant, v = constant. (10.31) 


We note that the relations (10.30) can be inverted to give 


1 . dL 
i= get, = 5 (v— 4). (10.32) 


Furthermore, from (10.29) we obtain 
OF 1 DG. 
oS =14+——_ -(1- =" ) ; (10.33) 
Or 3GnM = a r 


and as a result, we can write the Schwarzschild line element (10.27) 
in terms of u and v coordinates as 


2GyM 
dr? = (1 = os ) (dt? — dF”) = (1 = uk) dudv. (10.34) 


Here we have used (10.32) and we are supposed to understand that 
r is expressed in terms of u and v through the relation 


wu r 1 
part 2Guatin (7-1) = (v — wu), 


or, (soo = i) e2GnM = clint eu) (10.35) 
N 


which leads to 


10.2 SINGULARITIES OF THE SCHWARZSCHILD METRIC 303 


2G M “FNM ay 
(COM pee eee) (10.36) 
r 


We can, therefore, write the line element (10.34) as 


- 
Ce. 2Gy M (v—u) 


dr* = 2GyM ——— e?4n™ dudv. (10.37) 
r 


This form of the parameterization is known as the Eddington- 
Finkelstein parameterization (coordinates). Furthermore, analogous 
to the earlier example of the Rindler space, we can now define the 
coordinates (see (10.21)) 


U=-e NY, VV =e%nM, (10.38) 


so that the line element (10.37) takes the form 


eee 
e 2GNM 


dr? = 32(GyM)? = dd. (10.39) 


The Schwarzschild singularity at r = 2G, M has now disappeared 
from the line element with only a singularity left at r = 0. We know 
that this is a physical singularity and so we can now extend this form 
of the line element to all space with r > 0. The final identification 
with the Kruskal coordinates is made by defining (see (10.24)) 


ais V-U 


(10.40) 


so that the complete (four dimensional) Schwarzschild element is 
given by 


= Tr 
e 2GnM 


dr? = 32(G,.M)? (dT? — dR?) — r?(d6? + sin? 6d¢?). 


(10.41) 
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Note that the original coordinates can be expressed in terms of T 
and R as (see (10.32), (10.37), (10.38) and (10.40)) 


r ee eee 
=. | 2GQNM — _ — Rp2_72 
iy Je N UV = R?-T?, 


t 1 1 
4G,M = GM” a u) _ 3 (inv = In(—U)) 
= 5(n(R +7) -In(R=7)) = tam (F). (10-42) 


Thus, we can draw a space-time diagram similar to the Rindler 
space as shown in Fig. 10.2. 


r=0 


r = 2GnM,t — co 
II 


r = const 


Il I 


r = 2GNM,t — —oo 


r=0 


Figure 10.2: Schwarzschild space-time. 


We see from Fig. 10.2 that the region of space outside the Schwarzschild 
radius r = 2G, M is given by the wedge 


R>|T\. (10.43) 
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The original Schwarzschild singularity at r = 2G'yM corresponds to 
the null geodesics 


R=(TI. (10.44) 


Clearly the apparent singularity was due to a wrong choice of coordi- 
nates. The space now divides into four regions. Region I is, of course, 
the physical space outside of the Schwarzschild radius. Furthermore, 
r = 0 corresponds to the hyperbola 


yf cera beac eeae (10.45) 


It is denoted by the two shaded regions shown in Fig. 10.2. Note 
that in the two regions II and IV we have 


sce rea 0 (10.46) 


so that they are space-like in nature (in terms of the original t,r 
coordinates) whereas region III is time-like and hence represents an- 
other asymptotically flat region. If a light signal falls radially from 
region I, then within a finite proper time it would enter region II. 
However, once it enters region II it will be trapped there forever. 
That is, since it is a space-like region it will not be able to escape. 
For an observer far away from the Schwarzschild radius, however, it 
would appear as if the light signal would take forever to reach the 
Schwarzschild radius (see discussion around (9.42)). The light ray 
would be red shifted so much that to an observer far away its in- 
tensity would drop to zero. Thus it is clear that an observer cannot 
detect anything that happens inside the radius r = 2G, M. For this 
reason, the Schwarzschild radius is also known as the event horizon. 

Note also from the above discussion that even though there are 
two asymptotically flat regions, observers in these two regions cannot 
communicate with one another since light signals sent by them would 
eventually be lost in the intermediate regions. Region IT is known as 
a black hole, since according to classical ideas, it only absorbs and 
never emits any radiation. Region IV, on the other hand, is a time 
reversed case of region IT and is known as a white hole. In this region, 
matter and radiation cannot be present. They would eventually be 
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pushed out. Note also that the physical singularity at r = 0 occurs 
inside the regions II and IV and hence can never be detected by an 
observer outside the Schwarzschild radius. 


10.3 Black holes 


We may ask at this point whether the study of such regions is purely 
academic or whether there exist observational data to corroborate 
these. At the present time at least two black holes have been ob- 
served. The point is that although we cannot detect a black hole 
directly, its gravitational effect on other objects can be measured. 
There are several mechanisms to produce a black hole and one of the 
most popular ones is of the view that black holes are produced due 
to the gravitational collapse of a star. 


r=2GyM fod r=2GnNM 


Figure 10.3: Gravitational collapse of a star. 


For simplicity let us consider a spherically symmetric star un- 
dergoing spherical collapse because we can use the Schwarzschild 
solution which we have been studying so far. In this case the star 
would collapse under its own gravitational attraction as shown in 
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Fig. 10.3 and if the collapse is complete, then it would happen at 
the singularity r = 0. However, for gravitational purposes we would 
still feel as if a star is present with the same mass. If an observer 
is observing a star undergoing gravitational collapse, he would keep 
seeing it. The light ray from the star would be red shifted and hence 
the star would look steadily fainter until it completely fades out of 
the observer’s view. The observer would merely think that the star 
is still there but is completely burnt out. 

That gravitational collapse happens can be seen from the fol- 
lowing simple argument. First of all collapse occurs in burnt out 
stars. This is because, when a star is burning, there is thermal mo- 
tion which gives rise to the pressure needed to prevent collapse. If the 
star is burnt out, then the only pressure to balance collapse comes 
from the degeneracy pressure from the Pauli exclusion principle due 
to the presence of fermions. Thus for example, if the star contains n 
fermions of mass m per unit volume, then each fermion on an average 
would occupy a volume of space n~!. Therefore, if we assume that 


pw Ap, (10.47) 


then its momentum on an average can be estimated to be 


p~ Ap. —=hns. (10.48) 


As a result the average velocity of non-relativistic fermions can be 
obtained as 


u~ = = hn3m7!, (10.49) 


On the other hand, if the fermions are relativistic then we can ap- 
proximate their velocity by the speed of light (c = 1). 
The degeneracy pressure due to fermions is given by 


P=momentum x velocity x number density. (10.50) 


Therefore, if the fermions are nonrelativistic, we have 
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Pyp ~ hn? x in3m x n= Rn3m!, (10.51) 
This shows that in the nonrelativistic case only the electrons would 
contribute significantly to the degeneracy pressure since they are 
the lightest. If the fermions are relativistic, on the other hand, the 


degeneracy pressure is given by 


1 4 
Pa~ hn? x1xn=hn3, (10.52) 


so that all fermion species contribute equally. 

Let us now restrict to the case where the fermions are relativis- 
tic. Then, for stability, the degeneracy pressure has to balance the 
attractive gravitational pressure given by (we assume Gy = 1 in this 
discussion) 


M2 
Perav — “GA? (10.53) 
where the mass density can be identified with 
M 
= = nm 
r3 ; 
1 Te 
or, — = (nmM~)3. (10.54) 
r 


Putting this back in (10.53), we see that the gravitational pressure 
is given by 


9 4 4 4 Ce Beare 
Pray = M’n3m3 M~3 =n3m3 M3. (10.55) 


The star would, therefore, be stable only if 


Perav == Pr, 
4: Ae 2 4 
or, n3m3 M3 = hn3 


or,  M=(hm73)? = ham. (10.56) 
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Taking baryons to be the most abundant fermions in stars we can 
calculate the value of this critical mass to be (we should restore all 
the dimensional constants such as Gy and the final number is more 
like 3Mgun) 


M =him-? ~2 Moun, (10.57) 


which shows that a star twice as massive as the sun or more would 
undergo gravitational collapse, if it is completely burnt out. 

We have discussed only the simplest kind of a black hole, namely 
a Schwarzschild black hole. If the star is charged and/or rotating 
then we are also led to other kinds of black holes which we do not 
go into here. 
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CHAPTER 11 
Cosmological models and the big bang 
theory 


11.1 Homogeneity and isotropy 


In building a model of our universe, a lot of experimental observations 
go into the basic assumptions. Quite important among them is the 
observation that our universe is quite homogeneous and isotropic on 
a length scale of 10’ parsecs. (1 parsec is about 3.2 light years.) 
Thus, if we ignore the inhomogeneities and nonisotropic nature of 
the universe on smaller scales, the line element must reflect this. If 
we denote the line element as 


dp = dP ds", (11.1) 


then the three dimensional line element ds? must exhibit homogene- 
ity and isotropy. 

For a homogeneous and isotropic space, it is easy to show that 
the curvature tensor can be written in the form (see (8.76)) 


Ruvrp =k (Gur9up a IpupIvr)+ (11.2) 


This follows because if we choose a flat coordinate system at any 
point, then the metric will become the Minkowski metric. Further- 
more, if we want isotropy, the curvature should be independent of 
rotations at that point. Only the Minkowski metric is invariant un- 
der rotations. Thus, the curvature tensor can be written in terms 
of the Minkowski metric combinations at that point. Furthermore, 
the requirement of the appropriate antisymmetry properties would 
then, select out the curvature tensor to have the above form. It also 
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follows from the definition of homogeneity that the quantity kK, must 
be a constant since otherwise its value will distinguish between dif- 
ferent points in this space. (This would also follow from isotropy. 
If K is not a constant, then its gradients can select out a preferred 
direction.) 

Thus, we see that a homogeneous and isotropic space is a space 
of constant curvature. If we now apply this to our three dimensional 
space, we note that 


3 3 
A= =k (o0f% Ve go), (11.3) 
where the constant curvature K can depend on the time coordinate 
(since that is not part of the 3-dimensional space). It follows now 
that 
3) _ ,(3)A O. = 3 
RD = GOR) SOK (11.4) 


Choosing the 3-dimensional isotropic line element of the form 


ds? = eX) dr? + r?(d0? + sin? 0d¢?), (11.5) 


we can calculate the nontrivial curvature components which yield 


3) __ X(r) 
Ey ee 
RE ) = = cosec?0RY Jo145 TENG eee (11.6) 
Requiring 
RE) = 2K 9), (11.7) 
we obtain 
/ 
R® = a) o = 2Ke, 


RY) = 1+ 5 rr (r)e—* — e* = 2K rr’. (11.8) 
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This implies that 


1 
1+ 5 rr (r)e—* — e = 2Kr?, 


1 
or, ae x 2Kr? — e* = 2K r?, 


or, 1—e = Kr’, 
or, @ OO S(1= Kr?), 
or,  e) = (1— Kr?)1. (11.9) 


Thus, the form of the three dimensional line element is deter- 
mined to be (see also (8.88)) 


dr? 


de = 
1- Kr2 


+ r*(d6? + sin? 6d¢”). (11.10) 


If we now define a rescaled variable 


K 
r* = | Ac | rT, k = [K]’ (11.11) 


we can write (the form of the line element below is true even when 


K=0, ae tn = dr?) 


2 1 dy? DBO I Ai go 
ds* = —— tape (dd* + sin* 0d¢*) | , (11.12) 


where k = 1, —1, 0 corresponding to a space of positive, negative 
and zero curvature respectively. Sometimes, this is also written as 
(ignoring the “x” on the radial coordinate in (11.12)) 


2 
ds? = (R(t)? (; —_ + r?(d6? + sin” 6a6?)) : (11.13) 


The four dimensional line element (11.1) correspondingly can be writ- 
ten as 
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dr? = dt? — ds? 


4 
= at? — R(t) (2% + r?(d6? + sin? 6d¢?) J , (11.14) 
1— kr? 


and this is known as the Friedmann-Robertson-Walker line element. 
R(t) is known as the scale factor for reasons that will become clear 
shortly. 


11.2 Different models of the universe 


There are now three distinct cases to consider depending on k = 
+1,0. 


11.2.1 Close universe. Let us assume that k = 1. In this case, we can 
define coordinates 


r=siny, dr = cos xdx, (11.15) 


such that the line element (11.13) has the form 


ds? 


2 2 
R(t) (—- + sin? y(d0? + sin? oas?)) 


= R(t) (dx? + sin? x(d6? + sin? 6d¢”)) . (11.16) 


This indeed describes the three dimensional surface of a sphere of 
radius R(t) which can be checked from the fact that if we define 


x, = Reosy, 

zg = Rsinxsin@cos¢, 

z3 = Rsinxsin@sing, 

4 = Rsinyxcos8, (11.17) 


we have 
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and we can write 


dx? + dz.’ + da’ + dx. = R? (dy? +sin? y(d0? + sin? 0d¢’)) 
ole (11.19) 


Thus, the case of constant positive curvature corresponds to a closed, 
spherical universe. It has a finite volume. 


11.2.2 Flat universe. Let us next assume that k = 0. In this case, we 
can define the coordinates 


x; = Rrsinécos¢, 
x2 = RrsinOsind, 
x3 = Rrcos@, (11.20) 


which gives (see (11.13)) 


ds? = R? (dr? + r?(d6? + sin” 6d¢)) 
= da? + dz? + da;?. (11.21) 
This is, of course, the three dimensional flat space which is infinite 


in volume and hence the case k = 0 is also said to correspond to an 
open universe. 


11.2.3 Open universe. Finally, we consider the case k = —1. In this 
case, we can define the coordinates 


r =sinhy, dr = cosh xdx, (11.22) 


so that the three dimensional line element (11.13) becomes 
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dss = R (= xdx? 


© + sinh? y(d6? + sin? oa6?)) 
cosh* x 


= R? (dx? +sinh’ y(d6? + sin? 6d¢”)) . (11.23) 


It is easy to check that if we define 


x1; = Rcoshy, 

zg = Rsinhysin@cos 4, 

z3 = Rsinhysin@sing, 

x4 = Rsinhycos@, (11.24) 
then, 

te? =a) =o =a HRW, (11.25) 


and we can write 


ds? = —da,? + dap” + da,’ + da/. (11.26) 


This is what defines a three dimensional surface of a pseudosphere 
whose volume is infinite and, therefore, represents an open universe. 
However, this is different from the flat universe in that it has non- 
trivial curvature. 


11.3 Hubble’s law 


Let us now restrict only to the cases k = +1, the flat universe can be 
obtained in the limit R — oo. We can write the line element (11.21) 
as 


k=1: dr? =dt? — R(t) (dy? +sin? y(d6? + sin? 0d¢’)) , 


k=-1: dr? =dt? — R(t) (dx? + sinh? y(d6? + sin? 0d¢”)) 2 
(11.27) 
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The coordinates y, 9, ¢ are known as co-moving coordinates and 
are useful in the study of various cosmological phenomena. As the 
galaxies expand, they carry with them these coordinates and the dis- 
tance between galaxies change with time not because the coordinates 
change with time — rather because the metric changes with time. 

If we look at the radial 3-dimensional distance between two 
galaxies, then it has the form 


(t) = pu = fx R(t) = Rit) f ax — Roy. (11.28) 


where R(t) is called the scale factor since it shows how distances scale 
with time. From this, we obtain 


det) _ AR® __1_ ARH) py, 
dt di RQ) dt 
- RO py (11.29) 


where a dot denotes a derivative with respect to t. This shows that 
if the galaxies are moving away from one another, then the velocity 
of recession is proportional to their distance, namely, 


oe wo = = ((t) = He(t). (11.30) 


This is one form of Hubble’s law and identifies 


A(t)= ae = Hubble’s constant. (11.31) 


As we have noted, R(t) is known as the scale factor. Hubble’s con- 
stant (it is actually time dependent) is very crucial in cosmological 
studies and it is worth discussing how it is determined. 


Let us consider the radial motion for either & = +1. Then, from 
(11.27) we have 
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dr? dt? — R?(t)dy? 


R?(n)(dn? — dx”), (11.32) 


where we have defined 


dt = Rdn. (11.33) 


Propagation of light in this case, would correspond to 


dn? — dx? = 0. (11.34) 


The equation for the world line of the light signal in this case will be 
of the form 


x(7) = — No- (11.35) 


If two light signals are emitted (from the same point) at times 
No and no + Ano, then the two world lines would be described by 


xX 7) — 10; 
xX = n-no — Ano. (11.36) 


Thus, the two signals will arrive at a given coordinate x with a time 
difference 


An(x) = Ano = constant. (11.37) 


Namely, it will be a constant interval independent of the coordinate. 
The proper time interval, on the other hand, is given by (see (11.32)) 


Ar= RAN; 
or, Ru) = ne = constant, 


or,  R(n)v(y) = constant, (11.38) 
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where v is the frequency of the light wave at the coordinate yx. 

In an expanding universe R increases and, therefore, v must 
decrease inversely and this gives rise to a redshift in the signal. Since 
the product in (11.38) is a constant, we can write 


or, v(x) =v(0) 0) ——_——. (11.39) 


If y < 1, that is, if the separation of the galaxies is not very large, 
we can Taylor expand the right hand side in (11.39) and write 


v(x) = (0) a (2 ay ) 


dn 
v(x)— (0) | 1 A R(n) 
or, =a = = Re a x: (11.40) 


Using the defining relation (11.33) 


dt = Rdn, (11.41) 
we have 
dn _ p-l 
ae 
v(x) — v(0) dn dR 
a : 11.42 
(0) didn ee) 


~ Ry. (11.43) 


This, of course, gives the redshift and from the form of the 
Doppler shift formula (6.65) we note that 
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R 
XO) v=hx= 5 Rx £, ( ) 


giving us once again Hubble’s law (11.30). This shows that Hubble’s 
constant can be determined primarily from a study of the redshift of 
the galaxies and has the present value (from WMAP) of 


H ~ (7.04 + 0.15)cm/sec/parsec. (11.45) 


(There are further uncertainties associated with it. The error quoted 
here is only from random errors.) The inverse of the Hubble constant 
has the dimensions of time and defines a time scale which is normally 
identified with the age of the universe. The present value of the 
Hubble constant leads to an age of the universe 


Tiniverss = H-* 213.8 X10? yrs 2 435: % 10" sec. (11.46) 


11.4 Evolution equation 
Let us look at the Friedmann-Robertson- Walker line element (11.21) 


which has the form 


dr? 
1— kr? 


dr? = dt? — R(t) ( + r?(d6? + sin? oao?)) ; (11.47) 


The Lagrangian for the geodesic can be obtained from this to have 
the form (see (5.127)) 


wl R? 


1 
a 
2 2 1—kr2 


1 . 1 . 
pS 5 R776? — 5 R?r? sin? 667, 
(11.48) 
where a dot denotes a derivative with respect to 7. We can now 


obtain the geodesic equations from the Euler-Lagrange equations (see 
(5.130) and (5.131)). The t equation gives 
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te dR 1 32 2/2 2: 2712 
— | —> = 11.4 
t+R ai (<= +r°O +r°sin® 0¢ 0, (11.49) 
which determines the nontrivial components of Pie to be of the form 
(see (4.49) or (5.131)) 


dR 1 
0 — — 
Bis Ee ag. qecaeeae 
dR 
PeoRk “ r? sin? 0. (11.50) 


My => a Si (11.51) 


The r equation, leads to 


A 5 t op fk 1 3p 2kRr 2 
— —_______ Tr — —__-—-—/_——— es 
1 — kr? dt 1—kr? (1 — kr?)? 
2 . ; 
_ Es i? 4 R?r6? + Rr sin? 067 = 0, (11.52) 
—kr 
9) : ; , 
or, 7+ = tr + —a ¢ — (1 — kr?) G — sin? 06°) = 0. 
This determines 
1 dR 
Ta = Ta= Rad’ 
kr 
1 = 
nt = 1 — kr?’ 


Tyg = r(l—Fr’), 


Tf, = r(1—kr?)sin? @. (11.53) 
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Similarly, the 0 equation gives 


— R?r?6 — 2Rr? “ if — 2R?rr6 + Rr? sincos 0¢7 = 0, 
2 dR Bide. re i 
or, Oe ae io += RB r@ — sin @ cos 0¢* = 0, (11.54) 


which determines 


1 dR 
rm = = — 
1 
rm, = 12,=-+ 
12 21 rm 
T2, = sin@cosé. (11.55) 


Finally, the @ equation leads to 


— R?r? sin? 6¢ — 2Rr? sin? 6 - id — 2R? sin? Ord 
— 2R?r? sin 6 cos 66¢ = 0, 


2 
or, b+ R eu id oe = ae + 2cot 66¢ = 0, (11.56) 


and this determines 


1 dR 
T53 = R= eae 
1 
ry a Bie 
13 31 y? 
Ts, = T3,=—coté. (11.57) 


These are all the nonvanishing components of the connection. 
Given these we can calculate the components of curvature tensor 
which are defined as 


Rw = OTR, — OPA, +1408, — TAP 2s. (11.58) 


11.4 EVOLUTION EQUATION 
This leads to 


Roo = OM 49 — AL Go +T RV G0 — Toe Gy 


= 09 Tio + T'So + To) —Tolin — PoelGe — T3303 


3 @ (LER) 1 dR\? 
dt \R dt R dt 


Ry = OT}, — OFY +1T4F% -TLTQ 
+ (Tio +13 + To) ry + (Ti +13, + 13) My 


—THT io — Tio +P — PH — Tiel fe —Tislis 


d dR dR\? 
5 (eZ) +() 42k 


1 dR\? d2R 
= eee aca re oR ean), 


Rog = OV Xy — OP 3) +1450 S — 13,73 


2 1 


p21 — kr? 


= 0213, — OT 3y — O11 d_ + (Tip + V'Sq 4 To) Tb 
+ (Ti +13, + 13) P39 — DoaT'S0 —T 03, — Tool 32 


2 pl 3 p3 
_ PoP 99 _ P53P53 


d /, dR dR\? 
= 2 2a dit) ry ape? 2( at 
= cosec*) + r 3 (2 7) (1 — 3kr*) +r (=) 


kr 1 
l1—kr2 or 


dR\? d2R 
ee oy pase —— 49k 
e ( (F) ce a rp at ) 


R33 = sls — AT 33 +14, 133 — T3003, 


1 
+ r(1— kr?) ( ) ie r(1 — kr?) — cot? 6 
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= — 00133 — 01133 — 02133 + (Tio +19 + T'3o) 133 
Fe (Ti +13, + T3,) 133 + P3e0'33 — F330 30 
— 13303, — P3309 — 39033 — P31 133 — T3203 


=r? sin? ge (x =) — sin? 6(1 — 3kr) — (cos? 6 — sin? @) 


dt dt 
2 
+ r*sin? 6 (S) —k]| + cot @sin@ cos 6 
2 2 
=r’ sin? 6 (2 (=) +R a - 2] ; (11.59) 


Bde 
f= Ra 
1 dR\? d?R 
Ry =->z 2 ap tla ek Gig (11.60) 


We can now calculate the scalar curvature, 


R = g Roo + 9 Ri; 
3 2R 3 dR\? d2R 
R dt2 R2 


1 /dR\? 1@R_ k 
= -6 (a ($) a ae): (11.61) 


Therefore, we obtain the tt (00) component of Einstein’s equation to 
be 


_ 6. -faR\* _ 6 CR 6k 
R? \ dt 
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1 
Goo = Roo — 5 GR = 84GToo, 


3@R 1 1 /dR\? 1@R &k 
Or (=O) Ge (F) ‘Rae te = 87GT 0, 


SS Tig. (11.62) 


This equation will determine the evolution of the scale factor de- 
pending on the value of & and the other components of Einstein’s 
equation do not give any additional information. 

To determine the evolution of R(t), we need to assume the form 
of T¥” for the matter distribution of the universe. We assume that 
the universe consists of a gas of particles (galaxies) and on a large 
scale the velocity of this gas is the same as the velocity of expan- 
sion. (Namely, there is no appreciable root mean square deviation 
of velocities.) Thus, it is appropriate to neglect the contribution of 
pressure to the stress tensor T’” of the system. In that case, we can 
write 


T° = Too = p, (11.63) 


where p is the density of matter (energy) in our universe. Models 
where pressure is neglected (and also the cosmological constant is 
neglected) are known as the Friedmann models (universes) and the 
equation in this case becomes 


(11.64) 


A (aR)? | k _ 8G 
R? \ dt = 


Let us now study the three cases separately. 


11.4.1 k=1. The matter density varies inversely as the volume so 
that we can write (the unfamiliar factor in the volume comes from a 
nontrivial metric) 
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wee (11.65) 
0 O72 RS’ : 
where, we can think of M as the total mass of the universe. In this 
case, equation (11.64) has the form (for k = 1) 


eae | a apenas (11.66) 


1 /dR\* 1 4GM 
R237 R3" 


If we define, as before, (see (11.33)) 


i 
dt = Rdn, a => (11.67) 


then equation (11.66) can be rewritten in the form 


1= — 11. 
1 3 (11.68) 


The solution of this equation is quite simple. 


R(n) = R.(1— cos), 


2GM 
fe 30” 
t(n) = R,(n—sinn). (11.69) 


As we had noted earlier (see subsection 11.2.1), the case k = 1 
corresponds to a closed universe and the evolution of the universe 
(scale factor) obtained in (11.69) is shown in Fig 11.1. 

This is what is known as a periodic universe. Here the universe 
expands and then contracts eventually to a point and then starts all 
over again with a period of 27R,. 
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® Pin 
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Figure 11.1: For k = 1 the universe is closed and the evolution 


periodic. 


11.4.2 k=0. In this case, equation (11.64) has the form 


R2 t 37 R3’ 
dR 4GM 1 1 
or, = — = V2R, =, 
dt 37 R2 R 
1 
3 
ee. Rape (=) 3. (11.70) 


t(n) = 0. (11.71) 


As we have seen, k = 0 corresponds to the flat universe and as is clear 
from (11.70), in this case, the universe expands forever as shown in 
Fig. 11.2. 
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Figure 11.2: For k = 0 the universe is flat and open. 


11.4.3 k=-1. In this case, the evolution equation (11.64) takes the 
form 


1 /dR\? 1  4GM 

= |(—) -=s> = -——. 11.72 
R? ( dt ) R237 R3 ( ) 
Here the volume of the space is infinite and hence we cannot identify 
M with the total mass of the universe. Rather, we should think of 
this as the total mass contained in a volume 27?R?. The solution is 
again obtained simply by going to the 7-variable (11.33) 


dt = Rdn, 
dn 1 
—- = =. 11.73 
dt R ( ) 
The equation, in this variable, becomes 

1 dR\? 4GM 

2 ee) Sp 11.74 
(5 =) 3k’ Le 


whose solution is 
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2GM 
RG) -= < (cosh n — 1) = R,(cosh n — 1), 
t(n) = R,(sinhn— 7). (11.75) 


Once again, here the universe expands forever and as t > oo, the 
universe becomes flat as shown in Fig. 11.38. We note that for 
smaller values of 7 (or time t), the solutions (11.75) reduce to (11.71). 
Namely, initially the open universe starts out as the flat universe. 
However, for larger times (7 large) the evolution of the open uni- 
verse is linear 


Rt, (11.76) 


which is different from the flat universe. 
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Figure 11.3: For k = —1 the universe is open. 


11.5 Big bang theory and blackbody radiation 


It is clear from the forms of the solutions in the previous section 
that our universe started out at some instant of time from a very 
small region and is expanding since then. Even in the case of the 
periodic universe, we are presently in an expanding epoch. One 
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postulates that the univese started with a big bang from a small, 
densely packed, hot region and has been expanding ever since and 
the expansion is assumed to be adiabatic. This is known as the big 
bang theory due originally to Gamow. 

If the big bang theory is true, then there must be some relic of 
this in the background photon radiation. Namely, throughout the 
evolution of the universe the background photons must have been in 
thermal equilibrium with the temperature of the background radia- 
tion falling with the expansion of the universe. A measurement of the 
temperature of the cosmic microwave background (CMB) radiation 
can, therefore, test the idea of the big bang theory if only the relic 
temperature can be calculated theoretically. Therefore, there was 
a lot of interest in such a calculation of the background radiation 
temperature quite early on. A careful calculation of this temper- 
ature involves a detailed analysis of the primordial nucleosynthesis 
and leads to a value of the background temperature to be approx- 
imately 3 K. However, there is also a much simpler (approximate) 
derivation of this temperature due to Gamow which only uses the 
information about the evolution of the universe (avoiding the tech- 
nicalities associated with nucleosynthesis) and leads to a value of 
about 7 K for the temperature of the background radiation. The 
two numbers are in fact quite close and were theoretically derived 
much before the experimental observation of about 3 K for the tem- 
perature of the background radiation. The experimental observation 
of the appropriate relic temperature vindicates the big bang theory 
and in what follows we give the simpler derivation due to Gamow so 
as to avoid getting into the technicalities associated with the analysis 
of primordial nucleosynthesis. However, we would like to emphasize 
that a serious discussion of the background radiation temperature 
must involve the nucleosynthesis analysis. 

In the context of the big bang theory, at very early times the 
temperature of the universe was quite high and consequently it was 
radiation dominated (matter was negligible). We recall from statis- 
tical mechanics that the energy density for radiation in equilibrium 
at a given temperature can be related to its temperature through the 
Stefan-Boltzmann law as 


= A 
Prad = aS (11.77) 
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where 


o = 5.67 x 10-°gm/sec?-K4, (11.78) 
represents the Stefan-Boltzmann constant (constant of proportional- 
ity). The entropy density associated with radiation is given by 

SO T?, (11.79) 


Thus, the total entropy which remains constant in an adiabatic ex- 
pansion has the form 


S = SR? ~ (RT)? = constant, 
or, RT = constant. (11.80) 


In his calculation Gamow assumed that our universe is open 
(k =—1). Therefore, using (11.77) as well as (11.80) in the Einstein 
equation (11.64) for an open universe (k = —1), we obtain for earlier 
times 


1 (ey 1 8nGy 


R2 dt aa R2 a a Prad> 
arve 4 81Gy 4o 6 
or, (=) —T*= 3 x mo T 5 


dT 327GNO 3 
pacicage ney pean 
ae dt 303 


1 
33 4 1 
T(t) = | ————_] t?. 11.81 
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Here Gy denotes the gravitational constant (Newton’s constant) whose 
value is given by 


Gy = 6.6726 x 1078cm?/gm — sec”, (11.82) 
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and we have neglected the J* term since at high temperature (ear- 
lier times) the T° term on the right hand side dominates. (In this 
case, the constant in (11.80) also drops out of (11.81).) We have 
also chosen the negative sign for the square root to signify that the 
temperature of the universe decreases as time increases in an expand- 
ing universe. Equation (11.81) leads to the radiation density in the 
universe at any time as 


4do _4 40 3c3 2 
\ Sa ee G 
Praa(t) fons 3 1287Gyo 
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= ee t~? ~ 4.4722 x 10° t-?gm-sec”/em?. (11.83) 


This shows that the radiation density in the universe decreases with 
time as we would expect. 

Our universe is presently matter dominated and in a matter 
dominated universe the matter density has the form 


1 
Pmatter ™ R3 (11.84) 


Consequently, the Einstein equation (11.64) for a matter dominated 
open universe (k = —1) 


1 /dR\? 1 — 81Gy 8rGy 1 
— {(—) —-— = —— pr —— — 11.85 
Re ( dt ) 2 3 8° Bs Re ene?) 
can be approximated, for large values of R, by 
(any. 1 F 
R2 \ dt pes 
or, R(t)~t. (11.86) 


Thus, in this case, we have 


Pmatter (t) “— cS, 


or, Prnattar (t)t? = constant. (11.87) 
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The constant in (11.87) can be determined as follows. Taking the 
present estimate of the matter density in the universe as well as the 
present age of the universe (see (11.46)) 


Puistter (tp) = 24 x 10 em/cm?, 
ty © 13.73 x 10° yrs ~ 4.35 x 10!” sec, (11.88) 


we obtain 


Pmatter (tpt? ~ 2.4 x (4.35)> x 107° ~ 1.9755 x 10??gm-sec*/em3, 
(11.89) 


which determines the matter density at any earlier time t to be given 
by 


Pmatter(t) © 1.9755 x 1072 t~> gm-sec? /cm?. (11.90) 


Gamow’s idea was to extrapolate the two asymptotic behaviors 
in (11.83) and (11.90) in the following way. (This is clearly very 
simple minded, nonetheless the result is quite impressive.) Namely, 
we know that in the very early times the universe was radiation 
dominated. As time increased the relative density of radiation and 
matter decreased and presently we are in a matter dominated uni- 
verse. Therefore, at some instant of time, say t,, during the evolution 
the radiation density must have equalled the matter density. That 
would determine 


pias) = 4A SIP EPH 0b 0M ie = pane), 
(11.91) 


which determines 


__ 1.9755 x 10” 


x = 44722 x 10 sec ~ 4.417 x 10" sec. (11.92) 


Using this in (11.81) we obtain the temperature of the universe at 
that instant to be 
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T(t) = eee ree ee ee 
1287 Gyo do 327Gy a by 


1 
_ f 27% 10 x 44722 % 10°) 4 y 1 
Z 4 x 5.67 x 10-5 V4.417 x 108 


~ T2.27K. (11.93) 


Recalling that in the matter dominated epoch (see (11.86)), 


R(t) wt, (11.94) 


and using (11.80) we obtain 


T (tp) te 

TG). 2b! 

T (tp) = T(t,) ft ~72.97K x AY 3K. (11.98) 
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This very crude calculation shows that the temperature of the back- 
ground blackbody radiation in the universe at present must be about 
7 K which is very close to the observed value of about 3 K. As we 
have emphasized earlier, a careful determination of the background 
radiation temperature involves a careful analysis of the primordial 
nucleosynthesis in the universe and leads to a value of about 3 K 
consistent with the observed value. This marks a significant confir- 
mation of the big bang theory. 
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